Difference between revisions of "IU:TestPage"
R.sirgalina (talk | contribs) |
R.sirgalina (talk | contribs) |
||
Line 1: | Line 1: | ||
+ | = Big Data Technologies and Analytics = |
||
− | = Theoretical Mechanics = |
||
− | * '''Course name''': |
+ | * '''Course name''': Big Data Technologies and Analytics |
− | * '''Code discipline''': |
+ | * '''Code discipline''': N/A |
− | * '''Subject area''': |
+ | * '''Subject area''': P.1 Short Description |
== Short Description == |
== Short Description == |
||
− | This course covers the following concepts: |
+ | This course covers the following concepts: Advanced distributed data organization; Advanced distributed data processing. |
== Prerequisites == |
== Prerequisites == |
||
=== Prerequisite subjects === |
=== Prerequisite subjects === |
||
+ | |||
− | * CSE203 — Mathematical Analysis II: Linear algebra, vectors and matrices, partial derivatives. |
||
− | * CSE205 — Differential Equations: ODE. |
||
=== Prerequisite topics === |
=== Prerequisite topics === |
||
Line 23: | Line 22: | ||
! Section !! Topics within the section |
! Section !! Topics within the section |
||
|- |
|- |
||
− | | |
+ | | Introduction || |
+ | # What is Big Data |
||
− | # Introduction to theoretical mechanics |
||
− | # |
+ | # Characteristics of Big Data |
+ | # Technologies |
||
− | # Translatory and rotational motion of a rigid body |
||
+ | # Virtualization and cloud computing |
||
− | # Plane motion of a rigid body |
||
+ | |- |
||
− | # Spherical motion of a rigid body |
||
+ | | File systems and resource managers || |
||
− | # Motion of a free rigid body |
||
+ | # HDFS |
||
− | # Resultant motion |
||
+ | # YARN |
||
|- |
|- |
||
− | | |
+ | | Batch Processing || |
+ | # Distributed batch processing |
||
− | # Basic concepts and principles of Statics |
||
+ | # MapReduce model |
||
− | # Parallel forces and couples |
||
+ | # Applications |
||
− | # Equilibrium of a rigid body system in 2D |
||
+ | # Tasks management |
||
− | # Equilibrium of a rigid body system in 3D |
||
+ | # Patterns |
||
− | # Friction |
||
− | # Center of gravity |
||
|- |
|- |
||
− | | |
+ | | Stream Processing || |
+ | # CAP theorem |
||
− | # Particle dynamics |
||
+ | # Distributed storage and computation |
||
− | # Theorem of the motion of the center of mass of a system |
||
+ | # Distributed Stream Processing |
||
− | # Theorem of the change in the linear momentum of a system |
||
+ | # Usage patterns |
||
− | # Theorem of the change in the angular momentum of a system |
||
− | # Some cases of rigid body motion. |
||
− | # D’Alambert’s principle |
||
− | # Mechanical work and power |
||
− | # Theorem of the change in the kinetic energy of a system |
||
− | # The theory of impact |
||
− | # Oscillations |
||
|- |
|- |
||
− | | |
+ | | Analytics || |
+ | # Architecture |
||
− | # Constraints and their classification |
||
+ | # Use cases |
||
− | # Generalized coordinates |
||
+ | # SparkML |
||
− | # Generalized forces |
||
+ | # GraphX |
||
− | # The D’Alembert-Lagrange’s principle |
||
− | # The principle of virtual work |
||
− | # The General Equation of dynamics |
||
− | # Lagrange’s equations |
||
− | # The Hamilton’s equations |
||
|} |
|} |
||
== Intended Learning Outcomes (ILOs) == |
== Intended Learning Outcomes (ILOs) == |
||
=== What is the main purpose of this course? === |
=== What is the main purpose of this course? === |
||
+ | Nowadays companies need to manage vast amounts of data on a daily basis. Storing, sorting, accessing and analyzing obtaining synthetic information is considered one of the great challenges of the 21st century and and being effective in this may make the difference between success and failure. In order to gain a competitive advantage, Big Data and Analytics professionals are able to extract useful information from data and increase the Return Of Investments. In this course, students will be exposed to the key technologies and techniques, including R and Apache Spark, in order to analyze large-scale data sets and uncover valuable business information. |
||
− | The purpose of the course is to give basic and advanced knowledge on theoretical mechanics. The course covers kinematics of a particle and a rigid body, statics of rigid bodies, particle dynamics, dynamics of a system, analytical mechanics. The objective of the course is to give knowledge and skills which can be used further for calculating of kinematics, statics and dynamics of mechanical parts of robots and studying advanced courses on robotics. |
||
=== ILOs defined at three levels === |
=== ILOs defined at three levels === |
||
Line 71: | Line 60: | ||
==== Level 1: What concepts should a student know/remember/explain? ==== |
==== Level 1: What concepts should a student know/remember/explain? ==== |
||
By the end of the course, the students should be able to ... |
By the end of the course, the students should be able to ... |
||
+ | * Understanding of big data applications. |
||
− | * Methods for describing the laws of motion of a particle and a solid, |
||
+ | * Algorithms for the statistical analysis of big data |
||
− | * Methods for calculating the speeds and accelerations of points and bodies included in a mechanical system, |
||
+ | * Fundamental principles of predictive analytics |
||
− | * Methods for studying the equilibrium of mechanical systems, |
||
− | * Methods for creating differential equations of motion of a particle and a solid, |
||
− | * Methods for creating differential equations of motion of a mechanical system based on the classical approach, |
||
− | * Methods for creating differential equations of motion of a mechanical system based on methods of analytical mechanics. |
||
==== Level 2: What basic practical skills should a student be able to perform? ==== |
==== Level 2: What basic practical skills should a student be able to perform? ==== |
||
By the end of the course, the students should be able to ... |
By the end of the course, the students should be able to ... |
||
− | * How to |
+ | * How to process batch data |
+ | * How to process stream data |
||
− | * What calculation methods can be used to solve a specific problem, |
||
+ | * Advanced design of distributed architectures |
||
− | * What calculation methods are appropriate to use when solving a specific problem, |
||
+ | * Advanced design of distributed algorithms |
||
− | * What limitations and errors are imposed by a specific method when solving a problem. |
||
==== Level 3: What complex comprehensive skills should a student be able to apply in real-life scenarios? ==== |
==== Level 3: What complex comprehensive skills should a student be able to apply in real-life scenarios? ==== |
||
By the end of the course, the students should be able to ... |
By the end of the course, the students should be able to ... |
||
+ | * Write a program for batch processing |
||
− | * Analyze and explain mechanical phenomena based on the laws and theorems of theoretical mechanics, |
||
+ | * Write a program for stream processing |
||
− | * Apply the basic laws and methods of theoretical mechanics to solving technical problems, |
||
+ | * Design distributed processing pipelines |
||
− | * Create mathematical models, evaluate their value and the relativity of their limits of application. |
||
+ | * Desing distributed algorithms |
||
== Grading == |
== Grading == |
||
Line 113: | Line 100: | ||
! Activity Type !! Percentage of the overall course grade |
! Activity Type !! Percentage of the overall course grade |
||
|- |
|- |
||
− | | Labs/seminar classes || |
+ | | Labs/seminar classes || 30 |
|- |
|- |
||
− | | Interim performance assessment || |
+ | | Interim performance assessment || 30 |
|- |
|- |
||
| Exams || 40 |
| Exams || 40 |
||
Line 126: | Line 113: | ||
=== Open access resources === |
=== Open access resources === |
||
− | * |
+ | * Slides and material provided during the course. |
+ | * F. Provost and T. Fawcett. Data Science for Business. O’Reilly, 2013 |
||
− | * D. Deleanu Theoretical mechanics. Theory and applications / Dumitru Deleanu – Constanta: Nautica, 2012 |
||
+ | * Matthew North. Data Mining for the Masses, Second Edition: with implementations in RapidMiner and R. CreateSpace Independent Publishing Platform, 2012 |
||
− | * Stephen T. Thornton and Jerry B. Marion Classical Dynamics of Particles and Systems. 5th edition, 2004 |
||
+ | * Tom White. Hadoop: The Definitive Guide. O’Reilly Media, Inc., 2012 |
||
− | * Meshchersky I.V. Collection of Problems in Theoretical Mechanics 2014 |
||
+ | * Seema Acharya and Subhashini Chellappan. Big data and analytics. WileyIndia, 2016 |
||
− | * Prof . Dr. Ing. Vasile Szolga Theoretical Mechanics, 2010 |
||
− | * S.M. Targ Kratki kurs teoreticheskoi mechaniki, 1986 - in Russian |
||
− | * A.I. Lurie Analiticheskaya mechanika, 1961 - in Russian |
||
− | * Sbornik kursovych rabot po teoreticheskoi mechanike. A.A.Yablonski, 2000 - in Russian |
||
− | * Meshchersky I.V. Sbornik zadach po teoreticheskoi mechanike, 1986 - in Russian |
||
=== Closed access resources === |
=== Closed access resources === |
||
Line 147: | Line 130: | ||
|+ Activities within each section |
|+ Activities within each section |
||
|- |
|- |
||
− | ! Learning Activities !! Section 1 !! Section 2 !! Section 3 !! Section 4 |
+ | ! Learning Activities !! Section 1 !! Section 2 !! Section 3 !! Section 4 !! Section 5 |
|- |
|- |
||
− | | |
+ | | Testing (written or computer based) || 1 || 1 || 1 || 1 || 1 |
|- |
|- |
||
− | | |
+ | | Discussions || 1 || 1 || 1 || 1 || 1 |
|- |
|- |
||
− | | |
+ | | Development of individual parts of software product code || 0 || 1 || 1 || 1 || 1 |
+ | |- |
||
+ | | Homework and group projects || 0 || 1 || 1 || 1 || 1 |
||
+ | |- |
||
+ | | Midterm evaluation || 0 || 1 || 1 || 1 || 1 |
||
|} |
|} |
||
== Formative Assessment and Course Activities == |
== Formative Assessment and Course Activities == |
||
Line 165: | Line 152: | ||
! Activity Type !! Content !! Is Graded? |
! Activity Type !! Content !! Is Graded? |
||
|- |
|- |
||
+ | | Question || Describe the 6 Vs || 1 |
||
− | | Question || Calculate of the kinematic parameters of the particle according to the given laws of motion, it is required to determine: <br> particle trajectory, <br> particle velocity, <br> particle acceleration and its normal and tangential components, <br> radius of curvature of the trajectory. || 1 |
||
|- |
|- |
||
+ | | Question || Describe the technologies to support big data || 1 |
||
− | | Question || Calculate of the kinematic parameters of the planar mechanism, determine: <br> velocity of specific points of the mechanism and angular velocity of the links of the mechanism using the method of instantaneous velocity centers, <br> velocity of specific points of the mechanism and angular velocity of the links of the mechanism using the analytical method, <br> acceleration of specific points of the mechanism and angular accelerations of the links of the mechanism. || 1 |
||
|- |
|- |
||
+ | | Question || Design the structure of a cloud architecture for big data || 0 |
||
− | | Question || Calculate of the kinematics of the complex motion of a point, determine: <br> transport, relative and absolute velocity of the point, <br> transport, relative, Coriolis and absolute acceleration of the point. || 1 |
||
|- |
|- |
||
+ | | Question || Give examples of the 6 Vs in real systems || 0 |
||
− | | Question || Calculate of the kinematics of gears, determine the gear ratio, angular velocities and angular accelerations of links, velocities and accelerations of specific points of links for: <br> gearbox with fixed axles, <br> planetary gearbox with parallel axes, <br> planetary gearbox with intersecting axes. || 1 |
||
− | |- |
||
− | | Question || Make a synthesis of the laws of motion of a point and a solid, taking into account given conditions and restrictions. || 0 |
||
− | |- |
||
− | | Question || Do a kinematic analysis of complex planar mechanisms with a large number of links. || 0 |
||
− | |- |
||
− | | Question || Do a kinematic analysis of complex planar mechanisms with several degrees of freedom. || 0 |
||
− | |- |
||
− | | Question || Do a kinematic analysis of spatial mechanisms. || 0 |
||
− | |- |
||
− | | Question || Do a kinematic analysis of the complex motion of a solid body. || 0 |
||
|} |
|} |
||
==== Section 2 ==== |
==== Section 2 ==== |
||
Line 189: | Line 166: | ||
! Activity Type !! Content !! Is Graded? |
! Activity Type !! Content !! Is Graded? |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Describe the characteristics of the different nodes of HDFS || 1 |
|- |
|- |
||
− | | Question || |
+ | | Question || How files and blocks are managed || 1 |
|- |
|- |
||
− | | Question || |
+ | | Question || Describe the resource manager || 1 |
|- |
|- |
||
− | | Question || |
+ | | Question || Describe the lifecycle of an application || 1 |
|- |
|- |
||
− | | Question || |
+ | | Question || Describe and compare the scheduling approaches || 1 |
|- |
|- |
||
− | | Question || |
+ | | Question || Configure a HDFS cluster || 0 |
|- |
|- |
||
+ | | Question || Build a HDFS client || 0 |
||
− | | Question || Apply equilibrium equations for calculating the reactions of supports of a system of bodies in 2D. || 0 |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Use a HDFS command line || 0 |
|- |
|- |
||
− | | Question || |
+ | | Question || Configure YARN || 0 |
+ | |- |
||
+ | | Question || Evaluate the overall performance of YARN || 0 |
||
|} |
|} |
||
==== Section 3 ==== |
==== Section 3 ==== |
||
Line 213: | Line 192: | ||
! Activity Type !! Content !! Is Graded? |
! Activity Type !! Content !! Is Graded? |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Describe the MapReduce model || 1 |
|- |
|- |
||
+ | | Question || Describe tasks management || 1 |
||
− | | Question || Derive and solve differential equations based on the theorem on the motion of the center of mass of a system. || 1 |
||
|- |
|- |
||
+ | | Question || Describe patterns of usage || 1 |
||
− | | Question || Derive and solve differential equations based on the theorem on the change in the angular momentum of a system. || 1 |
||
|- |
|- |
||
+ | | Question || Solve with MapReduce a specific problem || 0 |
||
− | | Question || Derive and solve the differential equations of rectilinear and curvilinear motion of bodies that form a system with one degree of freedom. || 1 |
||
|- |
|- |
||
+ | | Question || Implement a usage pattern || 0 |
||
− | | Question || Derive and solve differential equations of motion based on the D’Alembert’s principle. || 1 |
||
+ | |} |
||
+ | ==== Section 4 ==== |
||
+ | {| class="wikitable" |
||
+ | |+ |
||
|- |
|- |
||
+ | ! Activity Type !! Content !! Is Graded? |
||
− | | Question || Derive and solve the differential equation based on the theorem on the change in kinetic energy. || 1 |
||
|- |
|- |
||
+ | | Question || Analyze the CAP theorem || 1 |
||
− | | Question || Apply the differential equations of a particle to study the motion of a body in a field of gravity under the influence of air resistance. || 0 |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Define the kinds of data storage available || 1 |
|- |
|- |
||
+ | | Question || Characteristics of stream processing || 1 |
||
− | | Question || Apply the theorem on the motion of the center of mass of a system to determine the dynamic reactions of the support of the mechanism. || 0 |
||
|- |
|- |
||
+ | | Question || Describe the usage patterns || 1 |
||
− | | Question || Apply the theorem on the change in the angular momentum of a system to study the gyroscopic effect. || 0 |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Build a program to solve a problem with stream processing || 0 |
|- |
|- |
||
+ | | Question || Interact with a NoSQL database || 0 |
||
− | | Question || Apply the kinetic energy change theorem to determine the velosity of bodies of a mechanical system. || 0 |
||
|} |
|} |
||
− | ==== Section |
+ | ==== Section 5 ==== |
{| class="wikitable" |
{| class="wikitable" |
||
|+ |
|+ |
||
Line 243: | Line 226: | ||
! Activity Type !! Content !! Is Graded? |
! Activity Type !! Content !! Is Graded? |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Features of SparkML || 1 |
− | |- |
||
− | | Question || What are cyclic coordinates? || 1 |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Features of GraphX || 1 |
|- |
|- |
||
+ | | Question || Write a program using SparkML || 0 |
||
− | | Question || Derive the differential equations of a mechanical system based on the General Equation of dynamics || 1 |
||
|- |
|- |
||
− | | Question || |
+ | | Question || Write a program using GraphX || 0 |
− | |- |
||
− | | Question || Apply the principle of virtual work to study the laws of motion of a mechanical system with one degree of freedom. || 0 |
||
− | |- |
||
− | | Question || Apply the General Equation of dynamics to study the laws of motion of a mechanical system with one degree of freedom. || 0 |
||
− | |- |
||
− | | Question || Apply the Lagrange’s equations to study the laws of motion of a mechanical system with several degrees of freedom. || 0 |
||
− | |- |
||
− | | Question || Apply the Lagrange’s equations to study the oscillations of a mechanical system with two degrees of freedom. || 0 |
||
|} |
|} |
||
=== Final assessment === |
=== Final assessment === |
||
'''Section 1''' |
'''Section 1''' |
||
+ | # Design the structure of a cloud architecture for a specific analytics type |
||
− | # Describe the vector, coordinate, and natural methods of specifying particle motion. Show the transition from one method to another. Find the velocity and acceleration of the particle in various methods. |
||
+ | # Give examples of the 6 Vs in real systems |
||
− | # Define the angular velocity vector and the angular acceleration vector of the body. Prove the independence of these vectors from the choice of the pole. Use the Euler vector formula to find the velocities and accelerations of points of a rotating rigid body and a rigid body making plane motion. |
||
− | # Describe ways to set the orientation of a solid in space, including Euler angles, Tight-Brian angles, quaternions. Show the methodology for determining the angular velocity vector in these cases. |
||
− | # Show the methodology of kinematic analysis of planar mechanisms, including the method of composing the equations of motion for the points of the mechanism, the theorems on the velocities and accelerations of body points in plane motion, and the instantaneous center of velocity method. |
||
− | # Show the methodology of kinematic analysis of the complex motion of a particle, the theorems on the addition of velocities and accelerations for complex motion of a particle. |
||
'''Section 2''' |
'''Section 2''' |
||
+ | # Configure a HDFS cluster with some specific replication approaches |
||
− | # Explain the basic axioms of statics. |
||
+ | # Build a HDFS client |
||
− | # Demonstrate the methods for determining the moment of force about a point and about an axis. |
||
+ | # Evaluate the performance of a specific configuration |
||
− | # Demonstrate the methods for transformation a couple of forces. |
||
+ | # Compare the different schedules |
||
− | # Demonstrate the method for determining the principal vector and the principal moment of the force system. |
||
− | # Describe the method for transformation a force system to the simplest possible form. |
||
'''Section 3''' |
'''Section 3''' |
||
+ | # Describe the advantages and disadvantages of the MapReduce model |
||
− | # Formulate the theorem of the motion of the center of mass of a system. Show for what problems this theorem is effective. |
||
+ | # Solve a task designing the solution using MapReduce |
||
− | # Formulate the D’Alambert’s principle. Show for what problems the calculation method based on this principle is effective. |
||
+ | # Solve a task designing the solution using a composition of usage patterns |
||
− | # Describe the concept of force field. Show the method for determining the work of a force at a movement of a particle in a potential force field. |
||
− | # Describe the processes that occur upon impact and methods for calculating the law of motion of the body upon impact. |
||
'''Section 4''' |
'''Section 4''' |
||
+ | # Identify problems and solutions related to the CAP theorem |
||
− | # Formulate the the principle of virtual work. Show for what problems the calculation method based on this principle is effective. |
||
+ | # Compare solutions with batch and stream processing approaches |
||
− | # Formulate the the D’Alembert-Lagrange’s principle. Show for problems tasks the calculation method based on this principle is effective. |
||
+ | # Design a system using a NoSQL database |
||
− | # Demonstrate methods for calculating generalized forces. |
||
+ | '''Section 5''' |
||
− | # Show the different forms of writing the Lagrange equations and explain in which cases each of these forms will be more convenient. |
||
+ | # Extend the SparkML library with a custom algorithm |
||
− | # Demonstrate the principles of choosing the most convenient method for solving a given specific problem of mechanics. |
||
+ | # Extend the GraphX library with a custom algorithm |
||
=== The retake exam === |
=== The retake exam === |
||
Line 294: | Line 263: | ||
'''Section 4''' |
'''Section 4''' |
||
+ | |||
+ | '''Section 5''' |
Revision as of 13:23, 10 November 2022
Big Data Technologies and Analytics
- Course name: Big Data Technologies and Analytics
- Code discipline: N/A
- Subject area: P.1 Short Description
Short Description
This course covers the following concepts: Advanced distributed data organization; Advanced distributed data processing.
Prerequisites
Prerequisite subjects
Prerequisite topics
Course Topics
Section | Topics within the section |
---|---|
Introduction |
|
File systems and resource managers |
|
Batch Processing |
|
Stream Processing |
|
Analytics |
|
Intended Learning Outcomes (ILOs)
What is the main purpose of this course?
Nowadays companies need to manage vast amounts of data on a daily basis. Storing, sorting, accessing and analyzing obtaining synthetic information is considered one of the great challenges of the 21st century and and being effective in this may make the difference between success and failure. In order to gain a competitive advantage, Big Data and Analytics professionals are able to extract useful information from data and increase the Return Of Investments. In this course, students will be exposed to the key technologies and techniques, including R and Apache Spark, in order to analyze large-scale data sets and uncover valuable business information.
ILOs defined at three levels
Level 1: What concepts should a student know/remember/explain?
By the end of the course, the students should be able to ...
- Understanding of big data applications.
- Algorithms for the statistical analysis of big data
- Fundamental principles of predictive analytics
Level 2: What basic practical skills should a student be able to perform?
By the end of the course, the students should be able to ...
- How to process batch data
- How to process stream data
- Advanced design of distributed architectures
- Advanced design of distributed algorithms
Level 3: What complex comprehensive skills should a student be able to apply in real-life scenarios?
By the end of the course, the students should be able to ...
- Write a program for batch processing
- Write a program for stream processing
- Design distributed processing pipelines
- Desing distributed algorithms
Grading
Course grading range
Grade | Range | Description of performance |
---|---|---|
A. Excellent | 90-100 | - |
B. Good | 75-89 | - |
C. Satisfactory | 60-74 | - |
D. Poor | 0-59 | - |
Course activities and grading breakdown
Activity Type | Percentage of the overall course grade |
---|---|
Labs/seminar classes | 30 |
Interim performance assessment | 30 |
Exams | 40 |
Recommendations for students on how to succeed in the course
Resources, literature and reference materials
Open access resources
- Slides and material provided during the course.
- F. Provost and T. Fawcett. Data Science for Business. O’Reilly, 2013
- Matthew North. Data Mining for the Masses, Second Edition: with implementations in RapidMiner and R. CreateSpace Independent Publishing Platform, 2012
- Tom White. Hadoop: The Definitive Guide. O’Reilly Media, Inc., 2012
- Seema Acharya and Subhashini Chellappan. Big data and analytics. WileyIndia, 2016
Closed access resources
Software and tools used within the course
Teaching Methodology: Methods, techniques, & activities
Activities and Teaching Methods
Learning Activities | Section 1 | Section 2 | Section 3 | Section 4 | Section 5 |
---|---|---|---|---|---|
Testing (written or computer based) | 1 | 1 | 1 | 1 | 1 |
Discussions | 1 | 1 | 1 | 1 | 1 |
Development of individual parts of software product code | 0 | 1 | 1 | 1 | 1 |
Homework and group projects | 0 | 1 | 1 | 1 | 1 |
Midterm evaluation | 0 | 1 | 1 | 1 | 1 |
Formative Assessment and Course Activities
Ongoing performance assessment
Section 1
Activity Type | Content | Is Graded? |
---|---|---|
Question | Describe the 6 Vs | 1 |
Question | Describe the technologies to support big data | 1 |
Question | Design the structure of a cloud architecture for big data | 0 |
Question | Give examples of the 6 Vs in real systems | 0 |
Section 2
Activity Type | Content | Is Graded? |
---|---|---|
Question | Describe the characteristics of the different nodes of HDFS | 1 |
Question | How files and blocks are managed | 1 |
Question | Describe the resource manager | 1 |
Question | Describe the lifecycle of an application | 1 |
Question | Describe and compare the scheduling approaches | 1 |
Question | Configure a HDFS cluster | 0 |
Question | Build a HDFS client | 0 |
Question | Use a HDFS command line | 0 |
Question | Configure YARN | 0 |
Question | Evaluate the overall performance of YARN | 0 |
Section 3
Activity Type | Content | Is Graded? |
---|---|---|
Question | Describe the MapReduce model | 1 |
Question | Describe tasks management | 1 |
Question | Describe patterns of usage | 1 |
Question | Solve with MapReduce a specific problem | 0 |
Question | Implement a usage pattern | 0 |
Section 4
Activity Type | Content | Is Graded? |
---|---|---|
Question | Analyze the CAP theorem | 1 |
Question | Define the kinds of data storage available | 1 |
Question | Characteristics of stream processing | 1 |
Question | Describe the usage patterns | 1 |
Question | Build a program to solve a problem with stream processing | 0 |
Question | Interact with a NoSQL database | 0 |
Section 5
Activity Type | Content | Is Graded? |
---|---|---|
Question | Features of SparkML | 1 |
Question | Features of GraphX | 1 |
Question | Write a program using SparkML | 0 |
Question | Write a program using GraphX | 0 |
Final assessment
Section 1
- Design the structure of a cloud architecture for a specific analytics type
- Give examples of the 6 Vs in real systems
Section 2
- Configure a HDFS cluster with some specific replication approaches
- Build a HDFS client
- Evaluate the performance of a specific configuration
- Compare the different schedules
Section 3
- Describe the advantages and disadvantages of the MapReduce model
- Solve a task designing the solution using MapReduce
- Solve a task designing the solution using a composition of usage patterns
Section 4
- Identify problems and solutions related to the CAP theorem
- Compare solutions with batch and stream processing approaches
- Design a system using a NoSQL database
Section 5
- Extend the SparkML library with a custom algorithm
- Extend the GraphX library with a custom algorithm
The retake exam
Section 1
Section 2
Section 3
Section 4
Section 5