Thông tin tài liệu
| Nhan đề : | High Performance Spark: Best Practices for Scaling and Optimizing Apache Spark |
| Tác giả : | Holden Karau |
| Chủ đề : | Apache Spark | Parallel processing (Electronic computers) | Big data | Distributed computing | Hệ thống phân tán |
| Năm xuất bản : | 2023 |
| Nhà xuất bản : | O'Reilly Media |
| Tóm tắt : | With this book, you'll learn how to: Accelerate your ML workflows with integrations including PyTorch. Handle key skew and take advantage of Spark's new dynamic partitioning. Make your code reliable with scalable testing and validation techniques. Make Spark high performance. Deploy Spark on Kubernetes and similar environments.Take advantage of GPU acceleration with RAPIDS and resource profiles. Get your Spark jobs to run faster. Use Spark to productionize exploratory data science projects. Handle even larger datasets with Spark. Gain faster insights by reducing pipeline running times. Become an O’Reilly member and get unlimited acces. |
| URI: | http://thuvienso.thanglong.edu.vn//handle/TLU/13670 |
| Bộ sưu tập | Khoa học máy tính - Toán |
XEM MÔ TẢ
1
XEM & TẢI
0
Danh sách tệp tin đính kèm:
