Actuarial ScienceManagementMaster of AccountingMaster of Business AdministrationMaster's in Business AnalyticsMaster's in Corporate FinanceMaster's in Management & Organizational LeadershipMaster's in Real Estate Analysis & DevelopmentPre-Major Business

Apache Spark Essential Training: Big Data Engineering

Instructor: Kumaran Ponnambalam

Data engineering is the foundation for building analytics and data science applications in the new Big Data world. Data engineering requires combining multiple big data technologies to construct data pipelines and networks to stream, process, and store data. This course focuses on building full-fledged solutions that combine Apache Spark with other Big Data tools to create end-to-end data pipelines. Instructor Kumaran Ponnambalam begins by defining data engineering, its functions, and its concepts. Next, Kumaran goes over how Spark capabilities such as parallel processing, execution plans, state management options, and machine learning work with extract, transform, load (ETL). He introduces you to batch processing use cases and processes, as well as real-time processing pipelines. After walking you through several useful best practices, Kumaran concludes with an end-to-end exercise project.

Learn More

Apache Spark Essential Training: Big Data Engineering

Excel for Accounting

The Business of Accounting

Accounting Foundations: Understanding the GAAP (Generally Accepted Accounting Principles)

Accounting Foundations: Global Finance and Accounting

Succeeding as an LGBT Professional

Accounting Foundations: Cost-Based Pricing Strategies

Empowering BIPOC through Mentorship

Excel: Implementing Balanced Scorecards with KPIs

Customer Service: Serving Customers Through Chat and Text

SharePoint Online: Working in the Modern Experience

Using Humor In Training to Engage Your Audience

Get Unstuck: Make a Plan to Move Your Career Forward

Data Dashboards in Power BI

Data Ingestion with Python

SOLIDWORKS: Drawings

Certified Analytics Professional (CAP) Cert Prep: Domains 5–7

Developing a Style Guide

Certified Analytics Professional (CAP) Cert Prep: Domains 1–4

Data Pipeline Automation with GitHub Actions Using R and Python

Photos for macOS Catalina Essential Training