PROGRAMMING BIG DATA APPLICATIONS by Domenico Talia, Paolo Trunfio, Fabrizio Marozzo, Loris Belcastro, Riccardo Cantini & Alessio Orsino

PROGRAMMING BIG DATA APPLICATIONS

By

Description

In the age of the Internet of Things and social media platforms, huge amounts of digital data are generated by and collected from many sources, including sensors, mobile devices, wearable trackers and security cameras. These data, commonly referred to as big data, are challenging current storage, processing and analysis capabilities. New models, languages, systems and algorithms continue to be developed to effectively collect, store, analyze and learn from big data.

Programming Big Data Applications introduces and discusses models, programming frameworks and algorithms to process and analyze large amounts of data. In particular, the book provides an in-depth description of the properties and mechanisms of the main programming paradigms for big data analysis, including MapReduce, workflow, BSP, message passing, and SQL-like. Through programming examples it also describes the most used frameworks for big data analysis like Hadoop, Spark, MPI, Hive and Storm. Each of the different systems is discussed and compared, highlighting their main features, their diffusion (both within their community of developers and among users), and their main advantages and disadvantages in implementing big data analysis applications.

Contents:
PrefaceAbout the AuthorsAcknowledgmentsList of FiguresList of TablesIntroduction:Motivation and GoalsMain TopicsAudience and OrganizationOnline ResourcesBig Data Concepts:Big Data Principles and FeaturesData Science ConceptsBig Data StorageScalable Data AnalysisParallel ComputingCloud ComputingToward Exascale ComputingParallel and Distributed Machine LearningProgramming Models for Big Data:Parallel Programming for Big Data ApplicationsThe MapReduce ModelThe Workflow ModelThe Message-Passing ModelThe BSP ModelThe SQL-Like ModelThe PGAS ModelModels for Exascale SystemsTools for Big Data applications:IntroductionMapReduce-based Programming ToolsWorkflow-based Programming ToolsMessage Passing-based Programming ToolsBSP-based Programming ToolsSQL-like Programming ToolsPGAS-based Programming ToolsComparing Programming Tools:IntroductionComparative Analysis of the System FeaturesComparative Analysis through Application ExamplesChoosing the Right Framework to Tame Big Data:The Input DataThe Application ClassThe InfrastructureOther FactorsSupplementary MaterialBibliographyIndex
Readership: Undergraduate and graduate students in computer science, computer engineering, data science, and data engineering. PhD students and researchers in computer science and engineering, and data science.

Key Features: Helps designers and developers in programming Big Data applications by identifying and selecting the best/appropriate programming tool based on their skills, hardware availability, application domains and purposes, and also considering the support provided by the developer community Presents real programming examples for each programming language/framework to show how Big Data applications can be developed

Preview

More Domenico Talia, Paolo Trunfio, Fabrizio Marozzo, Loris Belcastro, Riccardo Cantini & Alessio Orsino Books