Item Infomation


Title: High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark
Authors: Holden Karau
Keywords: Apache Spark | Parallel processing (Electronic computers) | Big data | Distributed computing | Hệ thống phân tán
Issue Date: 2023
Publisher: O'Reilly Media
Abstract: With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch. Handle key skew and take advantage of Spark's new dynamic partitioning. Make your code reliable with scalable testing and validation techniques. Make Spark high performance. Deploy Spark on Kubernetes and similar environments.Take advantage of GPU acceleration with RAPIDS and resource profiles. Get your Spark jobs to run faster. Use Spark to productionize exploratory data science projects. Handle even larger datasets with Spark. Gain faster insights by reducing pipeline running times. Become an O’Reilly member and get unlimited acces.
URI: http://thuvienso.thanglong.edu.vn//handle/TLU/13670
Appears in CollectionsKhoa học máy tính - Toán
ABSTRACTS VIEWS

1

VIEWS & DOWNLOAD

0

Files in This Item:
Thumbnail
  • TVS.008565_High Performance Spark (for Duc Ka) - Holden Karau, Anya Bida, Adi Polak & Rachel Warren - 2023.pdf
      Restricted Access
  • Đăng nhập để đọc nội dung file
    • Size : 1,13 MB

    • Format : Adobe PDF