Data Munging with Hadoop

Data Munging with Hadoop

Data Munging with Hadoop

The Example-Rich, Hands-On Guide to Data Munging with Apache Hadoop TM Data scientists spend much of their time "munging" data: handling day-to-day tasks such as data cleansing, normalization, aggregation, sampling, and transformation. These tasks are both critical and surprisingly interesting. Most important, they deepen your understanding of your data's structure and limitations: crucial insight for improving accuracy and mitigating risk in any analytical project.

Price
Paid
Platform
InformIT
Categories
eBook

Related Courses

Explore similar courses.

Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale

Practical Data Science with Hadoop and Spark: Designing and Building Effective Analytics at Scale

As adoption of Hadoop accelerates in the enterprise and beyond, there's soaring demand for those who can solve real world problems by applying advanced data science techniques in Hadoop environments. Now there's a complete and up-to-date guide to data science with Hadoop: high-level concepts, deep-dive techniques, practical applications, hands-on tutorials, and real-world use cases.

InformIT Learn more
Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture

Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture

This is the only complete foundational guide to virtualizing Hadoop and deploying it in the cloud. The authors demystify all aspects of virtualizing Hadoop at scale, empowering DBAs, BI specialists, integrators, architects, and managers to deploy quickly and achieve outstanding performance.¨Hadoop as a Service combines exceptional clarity for Hadoop newcomers with realistic examples for building deep technical skill.

InformIT Learn more

Get the latest news!