BIBLIO is the largest independent book marketplace in the world, with over 100 million books.

Skip to content

Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics Paperback - 2020

by Jules S. Damji; Brooke Wenig; Tathagata Das

Add to wish list

Reader reviews for Learning Spark: Lightning-Fast Data Analytics

From the publisher

Data is bigger, arrives faster, and comes in a variety of formats and it all needs to be processed at scale for analytics or machine learning. But how can you process such varied workloads efficiently? Enter Apache Spark.

Updated to include Spark 3.0, this second edition shows data engineers and data scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine learning algorithms. Through step-by-step walk-throughs, code snippets, and notebooks, you ll be able to:

  • Learn Python, SQL, Scala, or Java high-level Structured APIs
  • Understand Spark operations and SQL Engine
  • Inspect, tune, and debug Spark operations with Spark configurations and Spark UI
  • Connect to data sources: JSON, Parquet, CSV, Avro, ORC, Hive, S3, or Kafka
  • Perform analytics on batch and streaming data using Structured Streaming
  • Build reliable data pipelines with open source Delta Lake and Spark
  • Develop machine learning pipelines with MLlib and productionize models using MLflow

Details

  • Title Learning Spark: Lightning-Fast Data Analytics
  • Author Jules S. Damji; Brooke Wenig; Tathagata Das
  • Binding Paperback
  • Pages 397
  • Volumes 1
  • Language ENG
  • Publisher O'Reilly Media
  • Publication date 2020-08-25
  • Illustrated Yes
  • Features Illustrated, Index
  • ISBN 9781492050049 / 1492050040
  • Weight 1.4 lbs (0.64 kg)
  • Dimensions 9.2 x 7 x 0.9 in (23.37 x 17.78 x 2.29 cm)
  • Category Computers - General Information
  • Library of Congress subjects Machine learning, Data mining - Computer programs
  • Dewey Decimal Code 006.312

About the author

Jules S. Damji is a senior developer advocate at Databricks and an MLflow contributor. He is a hands-on developer with over 20 years of experience and has worked as a software engineer at leading companies such as Sun Microsystems, Netscape, @Home, Loudcloud/Opsware, Verisign, ProQuest, and Hortonworks, building large scale distributed systems. He holds a B.Sc. and an M.Sc. in computer science and an MA in political advocacy and communication from Oregon State University, Cal State, and Johns Hopkins University, respectively.

Brooke Wenig is a machine learning practice lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teaching courses on distributed machine learning best practices. Previously, she was a principal data science consultant at Databricks. She holds an M.S. in computer science from UCLA with a focus on distributed machine learning.

Tathagata Das is a staff software engineer at Databricks, an Apache Spark committer, and a member of the Apache Spark Project Management Committee (PMC). He is one of the original developers of Apache Spark, the lead developer of Spark Streaming (DStreams), and is currently one of the core developers of Structured Streaming and Delta Lake. Tathagata holds an M.S. in computer science from UC Berkeley.

Denny Lee is a staff developer advocate at Databricks who has been working with Apache Spark since 0.6. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premises and cloud environments. He also has an M.S. in biomedical informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise healthcare customers.

More Copies for Sale

Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by Damji, Jules S.; Wenig, Brooke; Das, Tathagata; Lee, Denny

  • Used
  • Very good
Condition
Very good
Edition
2
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
1
Seller
Item price
A$37.96
Free Delivery to USA

Show details

Description:
O'Reilly Media. 2. Very Good. It's a well-cared-for item that has seen limited use. The item may show minor signs of wear. All the text is legible, with all pages included. It may have slight markings and/or highlighting.
Add to wish list
Item price
A$37.96
Free Delivery to USA
Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by Damji, Jules S.; Wenig, Brooke; Das, Tathagata; Lee, Denny

  • Used
  • Very good
Condition
Very good
Edition
2
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
3
Seller
Item price
A$37.96
Free Delivery to USA

Show details

Description:
O'Reilly Media. 2. Very Good. It's a well-cared-for item that has seen limited use. The item may show minor signs of wear. All the text is legible, with all pages included. It may have slight markings and/or highlighting.
Add to wish list
Item price
A$37.96
Free Delivery to USA
Learning Spak

Learning Spak

by Jules Damji

  • New
  • Paperback
Condition
New
Binding
Paperback
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
10
Seller
Item price
A$89.47
A$19.00 Delivery to USA

Show details

Description:
Paperback / softback. New. Updated to emphasize new features in Spark 2.4., this second edition shows data engineers and scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics and employ machine-learning algorithms.
Add to wish list
Item price
A$89.47
A$19.00 Delivery to USA
Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by Damji, Jules S

  • Used
  • Paperback
Condition
Used
Edition
2
Binding
Paperback
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
1
Seller
Item price
A$69.29
Free Delivery to USA

Show details

Description:
O'Reilly Media, 2020-08-25. 2. paperback. Used: Good. 7.00x0.90x9.20. Buy with confidence. Excellent Customer Service & Return policy.
Add to wish list
Item price
A$69.29
Free Delivery to USA
Learning Spark
Stock photo: cover may vary

Learning Spark

by Damji, Jules/ Lee, Denny/ Wenig, Brooke/ Das, Tathagata,

  • Used
Condition
New
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
5
Seller
Item price
A$72.10
A$5.66 Delivery to USA

Show details

Description:
like new.
Add to wish list
Item price
A$72.10
A$5.66 Delivery to USA
Learning Spark

Learning Spark

by Denny Lee

  • New
  • Paperback
Condition
New
Binding
Paperback
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
4
Seller
Item price
A$100.98
A$15.26 Delivery to USA

Show details

Description:
Paperback / softback. New. New Book; Fast Shipping from UK; Not signed; Not First Edition; Updated to emphasize new features in Spark 2.4., this second edition shows data engineers and scientists why structure and unification in Spark matters. Specifically, this book explains how to perform simple and complex data analytics a
Add to wish list
Item price
A$100.98
A$15.26 Delivery to USA
Learning Spark
Stock photo: cover may vary

Learning Spark

by Damji, Jules/ Lee, Denny/ Wenig, Brooke/ Das, Tathagata,

  • New
Condition
New
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
5
Seller
Item price
A$76.02
A$5.66 Delivery to USA

Show details

Description:
new.
Add to wish list
Item price
A$76.02
A$5.66 Delivery to USA
Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by Damji, Jules S

  • New
  • Paperback
Condition
New
Edition
2
Binding
Paperback
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
6
Seller
Item price
A$81.70
Free Delivery to USA

Show details

Description:
O'Reilly Media, 2020-08-25. 2. paperback. New. 7.00x0.90x9.20. Buy with confidence. Excellent Customer Service & Return policy.
Add to wish list
Item price
A$81.70
Free Delivery to USA
Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by O'Reilly Media

  • New
Condition
New
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
120
Seller
Item price
A$84.09
A$5.66 Delivery to USA

Show details

Description:
O'Reilly Media. New. BRAND NEW, GIFT QUALITY! NOT OVERSTOCKS OR MARKED UP REMAINDERS! DIRECT FROM THE PUBLISHER!
Add to wish list
Item price
A$84.09
A$5.66 Delivery to USA
Learning Spark: Lightning-Fast Data Analytics
Stock photo: cover may vary

Learning Spark: Lightning-Fast Data Analytics

by Damji, Jules S.

  • Used
  • Good
  • Paperback
Condition
Good
Binding
Paperback
ISBN 10 / ISBN 13
9781492050049 / 1492050040
Quantity available
1
Seller
Item price
A$105.72
Free Delivery to USA

Show details

Description:
paperback. Good. Access codes and supplements are not guaranteed with used items. May be an ex-library book.
Add to wish list
Item price
A$105.72
Free Delivery to USA