High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark

Item Infomation

Title:	High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Authors:	Holden Karau
Keywords:	Apache Spark \| Parallel processing (Electronic computers) \| Big data \| Distributed computing \| Hệ thống phân tán
Issue Date:	2023
Publisher:	O'Reilly Media
Abstract:	With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch. Handle key skew and take advantage of Spark's new dynamic partitioning. Make your code reliable with scalable testing and validation techniques. Make Spark high performance. Deploy Spark on Kubernetes and similar environments.Take advantage of GPU acceleration with RAPIDS and resource profiles. Get your Spark jobs to run faster. Use Spark to productionize exploratory data science projects. Handle even larger datasets with Spark. Gain faster insights by reducing pipeline running times. Become an O’Reilly member and get unlimited acces.
URI:	http://thuvienso.thanglong.edu.vn//handle/TLU/13670
Appears in Collections	Khoa học máy tính - Toán

ABSTRACTS VIEWS

VIEWS & DOWNLOAD

Files in This Item:

Giới thiệu

Đăng nhập để đọc nội dung file