Difference between revisions of "IU:TestPage"

From IU
Jump to navigation Jump to search
Line 1: Line 1:
   
  +
= IT Product Development =
= Practical Machine Learning and Deep Learning =
 
* '''Course name''': Practical Machine Learning and Deep Learning
+
* '''Course name''': IT Product Development
* '''Code discipline''':
+
* '''Code discipline''': CSE807
  +
* '''Subject area''': Software Engineering
* '''Subject area''': Practical aspects of deep learning (DL); Practical applications of DL in Natural Language Processing, Computer Vision and generation.
 
   
 
== Short Description ==
 
== Short Description ==
  +
This course has two parts: 1) building and launching a user-facing software product with the special emphasis on understanding user needs and 2) the application of data-driven product development techniques to iteratively improve the product. Students will learn how to transform an idea into software requirements through user research, prototyping and usability tests, then they will proceed to launch the MVP version of the product. In the second part of the course, the students will apply an iterative data-driven approach to developing a product, integrate event analytics, and run controlled experiments.
 
   
 
== Prerequisites ==
 
== Prerequisites ==
   
 
=== Prerequisite subjects ===
 
=== Prerequisite subjects ===
  +
* CSE101: Introduction to Programming
* CSE202 — Analytical Geometry and Linear Algebra I / []: Manifolds "Linear Alg./Calculus: Manifolds
 
* CSE203 Mathematical Analysis II: Basics of optimisation
+
* CSE112: Software Systems Analysis and Design
  +
* CSE122 OR CSE804 OR CSE809 OR CSE812
* CSE201 — Mathematical Analysis I: integration and differentiation.
 
* CSE103 — Theoretical Computer Science: Graph theory basics, Spectral decomposition.
 
* CSE206 — Probability And Statistics: Multivariate normal dist.
 
* CSE504 — Digital Signal Processing: convolution, cross-correlation"
 
   
 
=== Prerequisite topics ===
 
=== Prerequisite topics ===
  +
* Basic programming skills.
 
  +
* OOP, and software design.
  +
* Familiarity with some development framework or technology (web or mobile)
   
 
== Course Topics ==
 
== Course Topics ==
Line 27: Line 26:
 
! Section !! Topics within the section
 
! Section !! Topics within the section
 
|-
 
|-
| Review. CNNs and RNNs ||
+
| From idea to MVP ||
  +
# Introduction to Product Development
# Image processing, FFNs, CNNs
 
  +
# Exploring the domain: User Research and Customer Conversations
# Training Deep NNs
 
  +
# Documenting Requirements: MVP and App Features
# RNNs, LSTM, GRU, Embeddings
 
  +
# Prototyping and usability testing
# Bidirectional RNNs
 
# Seq2seq
 
# Encoder-Decoder Networks
 
# Attention
 
# Memory Networks
 
 
|-
 
|-
  +
| Development and Launch ||
| Team Data Science Processes ||
 
  +
# Product backlog and iterative development
# Team Data Science Processes
 
  +
# Estimation Techniques, Acceptance Criteria, and Definition of Done
# Team Data Science Roles
 
  +
# UX/UI Design
# Team Data Science Tools (MLFlow, KubeFlow)
 
  +
# Software Engineering vs Product Management
# CRISP-DM
 
# Productionizing ML systems
 
 
|-
 
|-
  +
| Hypothesis-driven development ||
| VAEs, GANs ||
 
  +
# Hypothesis-driven product development
# Autoencoders
 
  +
# Measuring a product
# Variational Autoencoders
 
  +
# Controlled Experiments and A/B testing
# GANs, DCGAN
 
|}
 
== Intended Learning Outcomes (ILOs) ==
 
 
=== What is the main purpose of this course? ===
 
The course is about the practical aspects of deep learning. In addition to frontal lectures, the flipped classes and student project presentations will be organized. During lab sessions the working language is Python. The primary framework for deep learning is PyTorch. Usage of TensorFlow and Keras is possible, usage of Docker is highly appreciated.
 
 
=== ILOs defined at three levels ===
 
 
==== Level 1: What concepts should a student know/remember/explain? ====
 
By the end of the course, the students should be able to ...
 
* to apply deep learning methods to effectively solve practical (real-world) problems;
 
* to work in data science team;
 
* to understand of principles and a lifecycle of data science projects.
 
 
==== Level 2: What basic practical skills should a student be able to perform? ====
 
By the end of the course, the students should be able to ...
 
* to understand modern deep NN architectures;
 
* to compare modern deep NN architectures;
 
* to create a prototype of a data-driven product.
 
 
==== Level 3: What complex comprehensive skills should a student be able to apply in real-life scenarios? ====
 
By the end of the course, the students should be able to ...
 
* to apply techniques for efficient training of deep NNs;
 
* to apply methods for data science team organisation;
 
* to apply deep NNs in NLP and computer vision.
 
== Grading ==
 
 
=== Course grading range ===
 
{| class="wikitable"
 
|+
 
|-
 
! Grade !! Range !! Description of performance
 
|-
 
| A. Excellent || 90-100 || -
 
|-
 
| B. Good || 75-89 || -
 
|-
 
| C. Satisfactory || 60-74 || -
 
|-
 
| D. Poor || 0-59 || -
 
 
|}
 
|}
 
=== Course activities and grading breakdown ===
 
{| class="wikitable"
 
|+
 
|-
 
! Activity Type !! Percentage of the overall course grade
 
|-
 
| Labs/seminar classes || 20
 
|-
 
| Interim performance assessment || 30
 
|-
 
| Exams || 50
 
|}
 
 
=== Recommendations for students on how to succeed in the course ===
 
 
 
== Resources, literature and reference materials ==
 
 
=== Open access resources ===
 
* Goodfellow et al. Deep Learning, MIT Press. 2017
 
* Géron, Aurélien. Hands-On Machine Learning with Scikit-Learn and TensorFlow: Concepts, Tools, and Techniques to Build Intelligent Systems. 2017.
 
* Osinga, Douwe. Deep Learning Cookbook: Practical Recipes to Get Started Quickly. O’Reilly Media, 2018.
 
 
=== Closed access resources ===
 
 
 
=== Software and tools used within the course ===
 
 
= Teaching Methodology: Methods, techniques, & activities =
 
 
== Activities and Teaching Methods ==
 
{| class="wikitable"
 
|+ Activities within each section
 
|-
 
! Learning Activities !! Section 1 !! Section 2 !! Section 3
 
|-
 
| Development of individual parts of software product code || 1 || 1 || 1
 
|-
 
| Homework and group projects || 1 || 1 || 1
 
|-
 
| Midterm evaluation || 1 || 1 || 1
 
|-
 
| Testing (written or computer based) || 1 || 1 || 1
 
|-
 
| Discussions || 1 || 1 || 1
 
|}
 
== Formative Assessment and Course Activities ==
 
 
=== Ongoing performance assessment ===
 
 
==== Section 1 ====
 
{| class="wikitable"
 
|+
 
|-
 
! Activity Type !! Content !! Is Graded?
 
|-
 
| Question || Suppose you use Batch Gradient Descent and you plot the validation error at every epoch. If you notice that the validation error consistently goes up, what is likely going on? How can you fix this? || 1
 
|-
 
| Question || Is it a good idea to stop Mini-batch Gradient Descent immediately when the validation error goes up? || 1
 
|-
 
| Question || List the optimizers that you know (except SGD) and explain one of them || 1
 
|-
 
| Question || Describe Xavier (or Glorot) initialization. Why do you need it? || 1
 
|-
 
| Question || Name advantages of the ELU activation function over ReLU. || 0
 
|-
 
| Question || Can you name the main innovations in AlexNet, compared to LeNet-5? What about the main innovations in GoogLeNet and ResNet? || 0
 
|-
 
| Question || What is the difference between LSTM and GRU cells? || 0
 
|}
 
==== Section 2 ====
 
{| class="wikitable"
 
|+
 
|-
 
! Activity Type !! Content !! Is Graded?
 
|-
 
| Question || What is CRISP-DM? || 1
 
|-
 
| Question || What is TDSP? || 1
 
|-
 
| Question || How to use MLflow? || 1
 
|-
 
| Question || What is TensorBoard? || 1
 
|-
 
| Question || How to apply Kubeflow in practice? || 1
 
|-
 
| Question || Explain issues in distributed learning of deep NNs. || 0
 
|-
 
| Question || How do you organize your data science project? || 0
 
|-
 
| Question || Recall a checklist for organization of a typical data science project. || 0
 
|}
 
==== Section 3 ====
 
{| class="wikitable"
 
|+
 
|-
 
! Activity Type !! Content !! Is Graded?
 
|-
 
| Question || What is an Autoencoder? Can you list the structure and types of Autoencoders? || 1
 
|-
 
| Question || Can you describe ways to train Stacked AEs? || 1
 
|-
 
| Question || What is Denoising AE? Can you describe what is sparsity loss and why it can be useful? || 1
 
|-
 
| Question || Can you make a distinction between AE and VAE? || 1
 
|-
 
| Question || If an autoencoder perfectly reconstructs the inputs, is it necessarily a good autoencoder? How can you evaluate the performance of an autoencoder? || 0
 
|-
 
| Question || How do you tie weights in a stacked autoencoder? What is the point of doing so? || 0
 
|-
 
| Question || What about the main risk of an overcomplete autoencoder? || 0
 
|-
 
| Question || How the loss function for VAE is defined? What is ELBO? || 0
 
|-
 
| Question || Can you list the structure and types of a GAN? || 0
 
|-
 
| Question || How would you train a GAN? || 0
 
|-
 
| Question || How would you estimate the quality of a GAN? || 0
 
|-
 
| Question || Can you describe cost function of a Discriminator? || 0
 
|}
 
=== Final assessment ===
 
'''Section 1'''
 
# Explain what the Teacher Forcing is.
 
# Why do people use encoder–decoder RNNs rather than plain sequence-to-sequence RNNs for automatic translation?
 
# How could you combine a convolutional neural network with an RNN to classify videos?
 
'''Section 2'''
 
# Can you explain what it means for a company to be ML-ready?
 
# What a company can do to become ML-ready / Data driven?
 
# Can you list approaches to structure DS-teams? Discuss their advantages and disadvantages.
 
# Can you list and define typical roles in a DS team?
 
# What do you think about practical aspects of processes and roles in Data Science projects/teams?
 
'''Section 3'''
 
# Can you make a distinction between Variational approximation of density and MCMC methods for density estimation?
 
# What is DCGAN? What is its purpose? What are main features of DCGAN?
 
# What is your opinion about Word Embeddings? What types do you know? Why are they useful?
 
# How would you classify different CNN architectures?
 
# How would you classify different RNN architectures?
 
# Explain attention mechanism. What is self-attention?
 
# Explain the Transformer architecture. What is BERT?
 
 
=== The retake exam ===
 
'''Section 1'''
 
 
'''Section 2'''
 
 
'''Section 3'''
 

Revision as of 17:18, 20 April 2022

IT Product Development

  • Course name: IT Product Development
  • Code discipline: CSE807
  • Subject area: Software Engineering

Short Description

This course has two parts: 1) building and launching a user-facing software product with the special emphasis on understanding user needs and 2) the application of data-driven product development techniques to iteratively improve the product. Students will learn how to transform an idea into software requirements through user research, prototyping and usability tests, then they will proceed to launch the MVP version of the product. In the second part of the course, the students will apply an iterative data-driven approach to developing a product, integrate event analytics, and run controlled experiments.

Prerequisites

Prerequisite subjects

  • CSE101: Introduction to Programming
  • CSE112: Software Systems Analysis and Design
  • CSE122 OR CSE804 OR CSE809 OR CSE812

Prerequisite topics

  • Basic programming skills.
  • OOP, and software design.
  • Familiarity with some development framework or technology (web or mobile)

Course Topics

Course Sections and Topics
Section Topics within the section
From idea to MVP
  1. Introduction to Product Development
  2. Exploring the domain: User Research and Customer Conversations
  3. Documenting Requirements: MVP and App Features
  4. Prototyping and usability testing
Development and Launch
  1. Product backlog and iterative development
  2. Estimation Techniques, Acceptance Criteria, and Definition of Done
  3. UX/UI Design
  4. Software Engineering vs Product Management
Hypothesis-driven development
  1. Hypothesis-driven product development
  2. Measuring a product
  3. Controlled Experiments and A/B testing