Data Science Section

Career Objective

To provide a comprehensive understanding of shell scripting and GIT basics.

To introduce students to the Linux environment and scripting basics and to cover text and file processing in Linux.

To teach process management and advanced topics in Linux, including cron jobs and scheduling.

What you will learn?

Master Data Processing

Learn to work with various data processing technologies such as Apache Hadoop, Apache Spark, and Apache Kafka.
Understand how to design, build, and maintain scalable data pipelines.

Database Management

Gain proficiency in database management systems (DBMS) including SQL and NoSQL databases.
Learn to optimize database schemas for performance and scalability.

Data Warehousing

Explore concepts and tools related to data warehousing, including ETL (Extract, Transform, Load) processes, data modeling, and dimensional modeling.
Understand how to design and implement data warehouses for analytical purposes.

Big Data Technologies

Understand how to design and implement data warehouses for analytical purposes.
Learn to process and analyze large volumes of data efficiently.

MODULE SEQUENCE TO BE FOLLOWED:

News Card

Shell Scripting and GIT Basics

Buy at $ 5

News Card

Relational DataBase Management system

Buy at $ 5

News Card

Programming for Data Analytics

Buy at $ 10

News Card

Big Data and Hadoop

Buy at $ 10

News Card

Apache
Spark

Buy at $ 10

News Card

Data pipeline using Airflows

Buy at $ 10

News Card

NoSQL -
MongoDB

Buy at $ 8

News Card

Java for Data Engineering

Buy at $ 8

News Card

Kafka

Buy at $ 10

News Card

Cloud Computing-AWS

Buy at $ 10

This course includes:

Accordion Example

Frequently Asked Questions

This course is ideal for aspiring data engineers, software engineers looking to specialize in data, data analysts transitioning into engineering roles, and anyone interested in mastering data infrastructure and processing technologies.

Basic programming skills (preferably in Python, Java, or similar languages) and familiarity with databases and SQL are recommended. Understanding fundamental data concepts such as data types, data structures, and data manipulation is beneficial but not mandatory.

The course is structured into modules covering essential topics such as data processing technologies, database management, data warehousing, big data technologies, cloud data engineering, and data pipeline orchestration. Each module includes lectures, hands-on exercises, and projects to reinforce learning.

You will need access to a computer with internet connectivity. Specific tools and software such as Apache Spark, databases (e.g., PostgreSQL, MongoDB), and cloud platforms (e.g., AWS, GCP) will be introduced and utilized throughout the course.

Yes, upon successful completion of the course and projects, you will receive a certificate of achievement. This certificate validates your proficiency in data engineering concepts and practical skills.

Relevant Course

GIT with Docker

Upcoming

AWS Infrastructure

Upcoming

Introduction to DevOps , Linux & Bash Scripting

Upcoming

Data Engineering

Career Objective

What you will learn?

Master Data Processing

Database Management

Data Warehousing

Big Data Technologies

MODULE SEQUENCE TO BE FOLLOWED:

Shell Scripting and GIT Basics

Relational DataBase Management system

Programming for Data Analytics

Big Data and Hadoop

Apache Spark

Data pipeline using Airflows

NoSQL - MongoDB

Java for Data Engineering

Kafka

Cloud Computing-AWS

This course includes:

Frequently Asked Questions

Relevant Course

GIT with Docker

AWS Infrastructure

Introduction to DevOps , Linux & Bash Scripting

Category

Apache
Spark

NoSQL -
MongoDB