Hadoop is indispensable when it comes to processing big data—as necessary to understanding your information as servers are to storing it. This course is your introduction to Hadoop; key file systems used with Hadoop; its processing engine, MapReduce, and its many libraries and programming tools. Developer and big-data consultant Lynn Langit shows how to set up a Hadoop development environment, run and optimize MapReduce jobs, code basic queries with Hive and Pig, and build workflows to schedule jobs. Plus, learn about the depth and breadth of available Apache Spark libraries available for use with a Hadoop cluster, as well as options for running machine learning jobs on a Hadoop cluster.
Learn More- Events
- Career Fairs
- Resources
- Alumni Mentoring Program
- Internships and Jobs
- Skills Employers Are Looking For
- Career Podcasts
- Working Virtually
- Resumes and Cover Letters
- Interviewing and Thank Yous
- Researching Companies, Networking, & Career Fairs
- Hiring Statistics and Salary Information
- BA297, BA395A, and Bootcamps
- Build Skills with LinkedIn Learning
- International Students
- People We Serve
- Featured Jobs
- Change of Campus Students
- Careers in Your Major
- Employers
- About