Description: Advanced Analytics with PySpark by Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills Estimated delivery 4-14 business days Format Paperback Condition Brand New Description Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Sparks Python API, and other best practices in Spark programming. Publisher Description The amount of data being generated today is staggering--and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Sparks Python API, and other best practices in Spark programming.Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques--including classification, clustering, collaborative filtering, and anomaly detection--to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing.If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis.Familiarize yourself with Sparks programming model and ecosystemLearn general approaches in data scienceExamine complete implementations that analyze large public datasetsDiscover which machine learning tools make sense for particular problemsExplore code that can be adapted to many uses Author Biography Akash Tandon is an independent consultant and experienced full-stack data engineer. Previously, he was a senior data engineer at Atlan, where he built software for enterprise data science teams. In another life, he had worked on data science projects for governments, and built risk assessment tools at a FinTech startup. As a student, he wrote open source software with the R project for statistical computing and Google. In his free time, he researches things for no good reason. Sandy Ryza is software engineer at Elementl. Previously, he developed algorithms for public transit at Remix and was a senior data scientist at Cloudera and Clover Health. He is an Apache Spark committer, Apache Hadoop PMC member, and founder of the Time Series for Spark project. Uri Laserson is founder & CTO of Patch Biosciences. Previously, he worked on big data and genomics at Cloudera. Sean Owen is a principal solutions architect focusing on machine learning and data science at Databricks. He is an Apache Spark committer and PMC member, and co-author Advanced Analytics with Spark. Previously, he was director of Data Science at Cloudera and an engineer at Google. Josh Wills is an independent data science and engineering consultant, the former head of data engineering at Slack and data science at Cloudera, and wrote a tweet about data scientists once. Details ISBN 1098103653 ISBN-13 9781098103651 Title Advanced Analytics with PySpark Author Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, Josh Wills Format Paperback Year 2022 Pages 275 Publisher OReilly Media GE_Item_ID:135544974; About Us Grand Eagle Retail is the ideal place for all your shopping needs! With fast shipping, low prices, friendly service and over 1,000,000 in stock items - you're bound to find what you want, at a price you'll love! Shipping & Delivery Times Shipping is FREE to any address in USA. Please view eBay estimated delivery times at the top of the listing. Deliveries are made by either USPS or Courier. We are unable to deliver faster than stated. International deliveries will take 1-6 weeks. NOTE: We are unable to offer combined shipping for multiple items purchased. This is because our items are shipped from different locations. Returns If you wish to return an item, please consult our Returns Policy as below: Please contact Customer Services and request "Return Authorisation" before you send your item back to us. Unauthorised returns will not be accepted. Returns must be postmarked within 4 business days of authorisation and must be in resellable condition. Returns are shipped at the customer's risk. We cannot take responsibility for items which are lost or damaged in transit. For purchases where a shipping charge was paid, there will be no refund of the original shipping charge. Additional Questions If you have any questions please feel free to Contact Us. Categories Baby Books Electronics Fashion Games Health & Beauty Home, Garden & Pets Movies Music Sports & Outdoors Toys
Price: 55.64 USD
Location: Calgary, Alberta
End Time: 2024-09-28T06:06:09.000Z
Shipping Cost: 0 USD
Product Images
Item Specifics
Restocking Fee: No
Return shipping will be paid by: Buyer
All returns accepted: Returns Accepted
Item must be returned within: 30 Days
Refund will be given as: Money Back
ISBN-13: 9781098103651
Book Title: Advanced Analytics with PySpark
Number of Pages: 275 Pages
Language: English
Publication Name: Advanced Analytics with Pyspark : Patterns for Learning from Data at Scale Using Python and Spark
Publisher: O'reilly Media, Incorporated
Publication Year: 2022
Subject: Data Processing, Databases / Data Mining, Programming Languages / Python
Item Height: 0.6 in
Item Weight: 14.5 Oz
Type: Textbook
Subject Area: Computers
Item Length: 9.2 in
Author: Akash Tandon, Sean Owen, Uri Laserson, Josh Wills, Sandy Ryza
Item Width: 6.9 in
Format: Trade Paperback