book

BIG DATA ANALYTICS USING HADOOP

  • TypePrint
  • CategoryAcademic
  • Sub CategoryText Book
  • StreamComputer Science, Information Technology

Big data has become a buzzword for an exciting new set of tools and approaches for modern, data-driven applications that are revolutionising the way the world computes. To the dismay of statisticians, this all-encompassing term appears to be widely utilised to incorporate the application of well-known statistical techniques to huge datasets for predictive purposes. Despite the fact that big data has become a cliché, contemporary distributed computer techniques are enabling studies of datasets substantially larger than those previously analysed, with astonishing results.

Distributed computing, on the other hand, does not automatically lead to data science. Data products have emerged as a new economic paradigm as a result of the tremendous growth of datasets generated by the Internet and the insight that these datasets may be used to power prediction models ("more data is better than better algorithms"1). Stunning accomplishments of data modelling across vast heterogeneous datasets—for example, Nate Silver's seemingly supernatural ability to forecast the 2008 election using big data techniques—have led to a widespread recognition of data science's significance and attracted a diverse collection of practitioners to the subject.

By offering a framework for distributed data storage and parallel computation, Hadoop has developed from a cluster-computing abstraction to an operating system for big data. Spark has expanded on these concepts, making cluster computing more accessible to data scientists. However, data scientists and analysts who are new to distributed computing may believe that these technologies are designed for programmers rather than analysts. This is because a fundamental shift in thinking about how we handle and compute data in a parallel rather than sequential manner is required.

This book aims to educate data scientists for that shift in thinking by giving an accessible and straightforward overview of cluster computing and analytics. We'll cover the majority of the concepts, tools, and techniques involved in distributed computing for data analysis, as well as provide the groundwork for more in-depth exploration of specific topics.

By writing to a data scientist audience, this book aims to fill up the gap. From a data science standpoint, it will expose you to the world of clustered computing and analytics with Hadoop. The focus will be on common analytics, data warehousing approaches, and higher-order data workflows rather than deployment, operations, or software development.

 

Buy From
IIP Store ₹ 256
Amazon ₹ 320
Flipkart ₹ 320

**Note: IIP Store is the best place to buy books published by Iterative International Publishers. Price at IIP Store is always less than Amazon, Amazon Kindle, and Flipkart.

Book Title BIG DATA ANALYTICS USING HADOOP
Author(s) Prof. Kanahaiya Lal Ambashtha
ISBN 978-1-68576-345-9
Book Language ENGLISH
Published Date DECEMBER, 2022
Total Pages 132
Book Size 7x10 Standard
Paper Quality 75 GSM NORMAL PAPER
Book Edition FIRST EDITION

COMMENTS

    No Review found for book with Book title. BIG DATA ANALYTICS USING HADOOP

LEAVE A Comment

Related Books

MANUAL FOR PYTHON PROGRAMMING LABORATORY(21CSL46) AS PER VTU SYLLABUS
MANUAL FOR PYTH..
  • IIP1177,
  • Print
₹ 160 ₹ 200
Add to cart
BIG DATA AND ANALYTICS WITH CASE STUDIES
BIG DATA AND AN..
  • IIP1179,
  • Print
₹ 319 ₹ 399
Add to cart
UNLOCKING THE POWER OF LANGUAGE: INTEGRATING NLP AND INFORMATION RETRIEVAL FOR EFFECTIVE KNOWLEDGE EXTRACTION
UNLOCKING THE P..
  • IIP1162,
  • Print
₹ 280 ₹ 350
Add to cart
WhatsApp Button