CDH——Cloudera Developer Training for Spark and hadoop

IRENE · 发表于 2016-5-31 16:32:17

Cloudera Developer Training for Spark and hadoop

Course Time：2016年6月27-30日

Course Location：上海市浦东新区张江高科伯克利工程创新中心

Contact us：400-679-6113

QQ：1438118790

Certification：CCA-175

Learn how toimport data into your Apache Hadoop closter and process it with spark、hive、flume、sqoop、impala and other Hadoop ecosystem tools.

Audience and Prerequisites

This coursedesigned for developers and engineers who have programming experience. Apachespark examples and hands-on exercises are presented in Scala and Python, so theability to program in one of those languages is required. Basic familiaritywith the Linux command line is assumed. Basic knowledge of SQL is helpful. Priorknowledge of Hadoop is not required.

Course outline：DeveloperTraining for Spark and hadoop

Ø Introduction to Hadoop and the Hadoop ecosystem

Ø Hadoop architecture and HDFS

Ø Importing relational data with Apache spoop

Ø Introduction to impala and hive

Ø Modeling and managing data with impala and hive

Ø Data formats

Ø Data partitioning

Ø Capturing data with Apache flume

Ø Spark basics

Ø Working with RDDs in spark

Ø Writing and deploying spark applications

Ø Parallel programming with spark

Ø Spark caching and persistence

Ø Common patterns in spark data processing

Ø Preview：spark SQL

收藏本站

快速投稿

联系我们

广告服务

基石导航

峰会活动

基石数据

社区

CDH——Cloudera Developer Training for Spark and hadoop

浏览过的版块