課程目錄:Hadoop For Administrators培訓
4401 人關注
(78637/99817)
課程大綱:

   Hadoop For Administrators培訓

 

 

 

Introduction
Hadoop history, concepts
Ecosystem
Distributions
High level architecture
Hadoop myths
Hadoop challenges (hardware / software)
Labs: discuss your Big Data projects and problems
Planning and installation
Selecting software, Hadoop distributions
Sizing the cluster, planning for growth
Selecting hardware and network
Rack topology
Installation
Multi-tenancy
Directory structure, logs
Benchmarking
Labs: cluster install, run performance benchmarks
HDFS operations
Concepts (horizontal scaling, replication, data locality, rack awareness)
Nodes and daemons (NameNode, Secondary NameNode, HA Standby NameNode, DataNode)
Health monitoring
Command-line and browser-based administration
Adding storage, replacing defective drives
Labs: getting familiar with HDFS command lines
Data ingestion
Flume for logs and other data ingestion into HDFS
Sqoop for importing from SQL databases to HDFS, as well as exporting back to SQL
Hadoop data warehousing with Hive
Copying data between clusters (distcp)
Using S3 as complementary to HDFS
Data ingestion best practices and architectures
Labs: setting up and using Flume, the same for Sqoop
MapReduce operations and administration
Parallel computing before mapreduce: compare HPC vs Hadoop administration
MapReduce cluster loads
Nodes and Daemons (JobTracker, TaskTracker)
MapReduce UI walk through
Mapreduce configuration
Job config
Optimizing MapReduce
Fool-proofing MR: what to tell your programmers
Labs: running MapReduce examples
YARN: new architecture and new capabilities
YARN design goals and implementation architecture
New actors: ResourceManager, NodeManager, Application Master
Installing YARN
Job scheduling under YARN
Labs: investigate job scheduling
Advanced topics
Hardware monitoring
Cluster monitoring
Adding and removing servers, upgrading Hadoop
Backup, recovery and business continuity planning
Oozie job workflows
Hadoop high availability (HA)
Hadoop Federation
Securing your cluster with Kerberos
Labs: set up monitoring
Optional tracks
Cloudera Manager for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Cloudera distribution environment (CDH5)
Ambari for cluster administration, monitoring, and routine tasks; installation, use. In this track, all exercises and labs are performed within the Ambari cluster manager and Hortonworks Data Platform (HDP 2.0)

主站蜘蛛池模板: 色综合天天综合婷婷伊人| 观看 国产综合久久久久鬼色 欧美 亚洲 一区二区| 色欲久久久天天天综合网| 91在线亚洲综合在线| 人人狠狠综合久久亚洲88| 国产精品综合色区在线观看| 综合亚洲欧美三级| 中文字幕亚洲综合小综合在线| 久久―日本道色综合久久| 亚洲人成综合网站7777香蕉| 亚洲伊人成无码综合网| 开心久久婷婷综合中文字幕| 国产91色综合久久免费分享| 国产综合色在线精品| 五月丁香六月综合欧美在线| 五月婷婷综合在线| 色婷婷久久综合中文久久一本| 亚洲欧美精品综合中文字幕| 久久综合久久综合久久| 久久香综合精品久久伊人| 亚洲精品第一综合99久久| 国产成人精品综合久久久| 伊人久久亚洲综合影院| 亚洲人成综合网站7777香蕉| 亚洲欧美成人综合在线| 天天影视综合色区| 激情伊人五月天久久综合| 狠狠色噜噜狠狠狠狠色综合久AV| 亚洲综合另类小说色区色噜噜| 久久久久综合网久久| 2021精品国产综合久久| 天天综合久久一二三区| 久久婷婷五月综合色奶水99啪| 亚洲欧美日韩综合在线观看不卡顿| 国产成人99久久亚洲综合精品| 亚洲国产综合人成综合网站| 亚洲乱码中文字幕综合234| 2021精品国产综合久久| 亚洲综合欧美精品一区二区| 国产成人亚洲综合| 欧美亚洲综合另类成人|