Learning Spark: Lightning-Fast Data Analytics ペーパーバック – 2020/8/11
Jules S. Damji is a senior developer advocate at Databricks and an MLflow contributor. He is a hands-on developer with over 20 years of experience and has worked as a software engineer at leading companies such as Sun Microsystems, Netscape, @Home, Loudcloud/Opsware, Verisign, ProQuest, and Hortonworks, building large scale distributed systems. He holds a B.Sc. and an M.Sc. in computer science and an MA in political advocacy and communication from Oregon State University, Cal State, and Johns Hopkins University, respectively.
Brooke Wenig is a machine learning practice lead at Databricks. She leads a team of data scientists who develop large-scale machine learning pipelines for customers, as well as teaching courses on distributed machine learning best practices. Previously, she was a principal data science consultant at Databricks. She holds an M.S. in computer science from UCLA with a focus on distributed machine learning.
Tathagata Das is a staff software engineer at Databricks, an Apache Spark committer, and a member of the Apache Spark Project Management Committee (PMC). He is one of the original developers of Apache Spark, the lead developer of Spark Streaming (DStreams), and is currently one of the core developers of Structured Streaming and Delta Lake. Tathagata holds an M.S. in computer science from UC Berkeley.
Denny Lee is a staff developer advocate at Databricks who has been working with Apache Spark since 0.6. He is a hands-on distributed systems and data sciences engineer with extensive experience developing internet-scale infrastructure, data platforms, and predictive analytics systems for both on-premises and cloud environments. He also has an M.S. in biomedical informatics from Oregon Health and Sciences University and has architected and implemented powerful data solutions for enterprise healthcare customers.
- 出版社 : Oreilly & Associates Inc; 第2版 (2020/8/11)
- 発売日 : 2020/8/11
- 言語 : 英語
- ペーパーバック : 373ページ
- ISBN-10 : 1492050040
- ISBN-13 : 978-1492050049
- 寸法 : 17.78 x 2.29 x 23.37 cm
- Amazon 売れ筋ランキング: - 133,075位洋書 (の売れ筋ランキングを見る洋書)
I have the kindle edition and noticed that the formulas on one of the pages on machine learning was slightly cutoff at the edges but I wont remove a star because of that. In my view there are tons of material online to understand those regression formulas. What really worked for me is how great a job the authors have done in explaining how to use Spark 3.0.
Since I am a Python and SQL user this book really benefits me at work. The syntax and function explains are very clear and with an online Databricks account one can really practice as you learn with an uncomplicated dataset. How to program the Dataframe API is really well covered.