Item Infomation
| Title: | High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark |
| Authors: | Holden Karau |
| Keywords: | Apache Spark | Parallel processing (Electronic computers) | Big data | Distributed computing | Hệ thống phân tán |
| Issue Date: | 2023 |
| Publisher: | O'Reilly Media |
| Abstract: | With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch. Handle key skew and take advantage of Spark's new dynamic partitioning. Make your code reliable with scalable testing and validation techniques. Make Spark high performance. Deploy Spark on Kubernetes and similar environments.Take advantage of GPU acceleration with RAPIDS and resource profiles. Get your Spark jobs to run faster. Use Spark to productionize exploratory data science projects. Handle even larger datasets with Spark. Gain faster insights by reducing pipeline running times. Become an O’Reilly member and get unlimited acces. |
| URI: | http://thuvienso.thanglong.edu.vn//handle/TLU/13670 |
| Appears in Collections | Khoa học máy tính - Toán |
ABSTRACTS VIEWS
1
VIEWS & DOWNLOAD
0
Files in This Item:
