Download e-book for iPad: Apache Spark 2.x for Java Developers by Sourav Gulati,Sumit Kumar

By Sourav Gulati,Sumit Kumar

ISBN-10: 1787126498

ISBN-13: 9781787126497

Key Features

  • Perform gigantic information processing with Spark—without having to profit Scala!
  • Use the Spark Java API to enforce effective enterprise-grade functions for facts processing and analytics
  • Go past mainstream facts processing via including querying potential, laptop studying, and graph processing utilizing Spark

Book Description

Apache Spark is the buzzword within the titanic information instantly, specially with the expanding want for real-time streaming and knowledge processing. whereas Spark is outfitted on Scala, the Spark Java API exposes all of the Spark beneficial properties to be had within the Scala model for Java builders. This publication will exhibit you ways you could enforce numerous functionalities of the Apache Spark framework in Java, with no stepping from your convenience zone.

The e-book begins with an advent to the Apache Spark 2.x atmosphere, by way of explaining the way to set up and configure Spark, and refreshes the Java innovations that may be priceless to you while eating Apache Spark's APIs. you are going to discover RDD and its linked universal motion and Transformation Java APIs, organize a production-like clustered setting, and paintings with Spark SQL. relocating on, you are going to practice near-real-time processing with Spark streaming, desktop studying analytics with Spark MLlib, and graph processing with GraphX, all utilizing quite a few Java packages.

By the tip of the e-book, you have got a superior beginning in enforcing parts within the Spark framework in Java to construct speedy, real-time applications.

What you'll learn

  • Process information utilizing assorted dossier codecs resembling XML, JSON, CSV, and simple and delimited textual content, utilizing the Spark middle Library.
  • Perform analytics on info from quite a few facts resources equivalent to Kafka, and Flume utilizing Spark Streaming Library
  • Learn SQL schema construction and the research of established facts utilizing a variety of SQL features together with Windowing capabilities within the Spark SQL Library
  • Explore Spark Mlib APIs whereas enforcing computer studying recommendations to resolve real-world problems
  • Get to grasp Spark GraphX so that you comprehend a variety of graph-based analytics that may be played with Spark

About the Author

Sourav Gulati is linked to software program for greater than 7 years. He all started his occupation with Unix/Linux and Java after which moved in the direction of immense info and NoSQL international. He has labored on a number of mammoth information tasks. He has lately begun a technical web publication referred to as Technical studying to boot. except IT global, he likes to examine mythology.

Sumit Kumar is a developer with insights in telecom and banking. At various junctures, he has labored as a Java and SQL developer, however it is shell scripting that he unearths either hard and gratifying even as. at the moment, he provides vast info initiatives interested in batch/near-real-time analytics and the disbursed listed querying approach. in addition to IT, he's taking a prepared curiosity in human and ecological issues.

Table of Contents

  1. Introduction to Spark
  2. Java for Spark
  3. Let's Spark
  4. Understanding Spark Programming model
  5. Working with info & storage
  6. Spark on Cluster
  7. Spark Programming version - increase concepts
  8. Working with Spark SQL
  9. Near genuine time processing with Spark Streaming
  10. Machine studying analytics with Spark MLlib
  11. Learning Spark GraphX

Show description

Read or Download Apache Spark 2.x for Java Developers PDF

Best data modeling & design books

Vladimir Cherkassky,Filip M. Mulier's Learning from Data: Concepts, Theory, and Methods (Adaptive PDF

An interdisciplinary framework for studying methodologies-covering statistics, neural networks, and fuzzy common sense This publication offers a unified remedy of the rules and strategies for studying dependencies from facts. It establishes a basic conceptual framework within which quite a few studying tools from records, neural networks, and fuzzy good judgment should be applied-showing few primary ideas underlie so much new tools being proposed at the present time in facts, engineering, and laptop technology.

R Quick Syntax Reference - download pdf or read online

The R quickly Syntax Reference is a convenient reference booklet detailing the intricacies of the R language. not just is R a unfastened, open-source device, R is strong, versatile, and has state-of-the-art statistical innovations on hand. With the various information which has to be right while utilizing any language, notwithstanding, the R speedy Syntax Reference makes utilizing R more straightforward.

Read e-book online Machine Learning with R Cookbook - 110 Recipes for Building PDF

Key FeaturesApply R to simplify predictive modeling with brief and easy codeUse computer studying to unravel difficulties starting from small to important dataBuild a coaching and checking out dataset from the churn dataset, employing diversified type methodsBook DescriptionThe R language is a strong open resource sensible programming language.

Read e-book online MDX with Microsoft SQL Server 2016 Analysis Services PDF

Over 70 sensible recipes to research multi-dimensional facts in SQL Server 2016 research companies cubesAbout This BookUpdated for SQL Server 2016, this e-book is helping you are taking benefit of the hot MDX instructions and the recent positive aspects brought in SSASPerform time-related, context-aware, and company related-calculations comfortably to complement your enterprise Intelligence solutionsCollection of recommendations to write down versatile and excessive acting MDX queries in SSAS with rigorously dependent examplesWho This booklet Is ForThis booklet is for a person who has been enthusiastic about operating with multidimensional info.

Additional resources for Apache Spark 2.x for Java Developers

Example text

Download PDF sample

Apache Spark 2.x for Java Developers by Sourav Gulati,Sumit Kumar

by Anthony

Rated 4.09 of 5 – based on 5 votes