Data Analysis With Python and PySpark - Brossura

Rioux, Jonathan

9781617297205: Data Analysis With Python and PySpark

Brossura

ISBN 10: 1617297208 ISBN 13: 9781617297205

Casa editrice: Manning Pubns Co, 2022

Vedi tutte le copie di questa edizione con ISBN

8 Usato

Da: EUR 27,54

25 Nuovo

Da: EUR 56,72

Think big about your data! PySpark brings the powerful Spark big data processing engine to the Python ecosystem, letting you seamlessly scale up your data tasks and create lightning-fast pipelines. In Data Analysis with Python and PySpark you will learn how to:     Manage your data as it scales across multiple machines     Scale up your data programs with full confidence     Read and write data to and from a variety of sources and formats     Deal with messy data with PySpark’s data manipulation functionality     Discover new data sets and perform exploratory data analysis     Build automated data pipelines that transform, summarize, and get insights from data     Troubleshoot common PySpark errors     Creating reliable long-running jobs Data Analysis with Python and PySpark is your guide to delivering successful Python-driven data projects. Packed with relevant examples and essential techniques, this practical book teaches you to build pipelines for reporting, machine learning, and other data-centric tasks. Quick exercises in every chapter help you practice what you’ve learned, and rapidly start implementing PySpark into your data systems. No previous knowledge of Spark is required. Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications. About the technology The Spark data processing engine is an amazing analytics factory: raw data comes in, insight comes out. PySpark wraps Spark’s core engine with a Python-based API. It helps simplify Spark’s steep learning curve and makes this powerful tool available to anyone working in the Python data ecosystem. About the book Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You’ll learn how to scale your processing capabilities across multiple machines while ingesting data from any source—whether that’s Hadoop clusters, cloud data storage, or local data files. Once you’ve covered the fundamentals, you’ll explore the full versatility of PySpark by building machine learning pipelines, and blending Python, pandas, and PySpark code. What's inside     Organizing your PySpark code     Managing your data, no matter the size     Scale up your data programs with full confidence     Troubleshooting common data pipeline problems     Creating reliable long-running jobs About the reader Written for data scientists and data engineers comfortable with Python. About the author As a ML director for a data-driven software company, Jonathan Rioux uses PySpark daily. He teaches the software to data scientists, engineers, and data-savvy business analysts. Table of Contents 1 Introduction PART 1 GET ACQUAINTED: FIRST STEPS IN PYSPARK 2 Your first data program in PySpark 3 Submitting and scaling your first PySpark program 4 Analyzing tabular data with pyspark.sql 5 Data frame gymnastics: Joining and grouping PART 2 GET PROFICIENT: TRANSLATE YOUR IDEAS INTO CODE 6 Multidimensional data frames: Using PySpark with JSON data 7 Bilingual PySpark: Blending Python and SQL code 8 Extending PySpark with Python: RDD and UDFs 9 Big data is just a lot of small data: Using pandas UDFs 10 Your data under a different lens: Window functions 11 Faster PySpark: Understanding Spark’s query planning PART 3 GET CONFIDENT: USING MACHINE LEARNING WITH PYSPARK 12 Setting the stage: Preparing features for machine learning 13 Robust machine learning with ML Pipelines 14 Building custom ML transformers and estimators

Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.

Informazioni sull'autore

As a data scientist for an engineering consultancy Jonathan Rioux uses PySpark daily. He teaches the software to data scientists, engineers, and data-savvy business analysts.

Le informazioni nella sezione "Su questo libro" possono far riferimento a edizioni diverse di questo titolo.

Editore: Manning Pubns Co
Data di pubblicazione: 2022
Lingua: Inglese
ISBN 10: 1617297208
ISBN 13: 9781617297205
Rilegatura: Copertina flessibile
Numero di pagine: 434
Contatto del produttore: non disponibile
Persona responsabile: non disponibile

Compra usato

Condizioni: molto buono

May have limited writing in cover...

Visualizza questo articolo

EUR 27,54

Spedizione gratuita
Spedito in U.S.A.

Aggiungi al carrello

Compra nuovo

Visualizza questo articolo

EUR 56,72

Spedizione EUR 2,32
Spedito in U.S.A.

Aggiungi al carrello

Risultati della ricerca per Data Analysis With Python and PySpark

Foto dell'editore

Data Analysis with Python and Pyspark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Paperback

Da: ThriftBooks-Atlanta, AUSTELL, GA, U.S.A.

Valutazione del venditore 5 su 5 stelle

Paperback. Condizione: Very Good. No Jacket. May have limited writing in cover pages. Pages are unmarked. ~ ThriftBooks: Read More, Spend Less. Codice articolo G1617297208I4N00

Contatta il venditore

Compra usato

EUR 27,54

Spedizione gratuita
Spedito in U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Foto dell'editore

Data Analysis with Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Brossura

Da: More Than Words, Waltham, MA, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: Good. A sound copy with only light wear. Overall a solid copy at a great price! Codice articolo BOS-K-05g-01804

Contatta il venditore

Compra usato

EUR 26,52

Spedizione EUR 3,51
Spedito in U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Analysis with Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Brossura

Da: -OnTimeBooks-, Phoenix, AZ, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: very_good. Gently read. May have name of previous ownership, or ex-library edition. Binding tight; spine straight and smooth, with no creasing; covers clean and crisp. Minimal signs of handling or shelving. 100% GUARANTEE! Shipped with delivery confirmation, if you're not satisfied with purchase please return item! Ships USPS Media Mail. Codice articolo OTV.1617297208.VG

Contatta il venditore

Compra usato

EUR 37,92

Spedizione gratuita
Spedito in U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Analysis with Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Brossura

Da: Goodbooks Company, Springdale, AR, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: acceptable. This copy has liquid damage. Codice articolo GBV.1617297208.A

Contatta il venditore

Compra usato

EUR 35,37

Spedizione EUR 4,39
Spedito in U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Foto dell'editore

Data Analysis with Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Brossura

Da: medimops, Berlin, Germania

Valutazione del venditore 5 su 5 stelle

Condizione: good. Befriedigend/Good: Durchschnittlich erhaltenes Buch bzw. Schutzumschlag mit Gebrauchsspuren, aber vollst�ndigen Seiten. / Describes the average WORN book or dust jacket that has all the pages present. Codice articolo M01617297208-G

Contatta il venditore

Compra usato

EUR 35,30

Spedizione EUR 10,00
Spedito da Germania a U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Foto dell'editore

Data Analysis With Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: As New. Unread book in perfect condition. Codice articolo 43997875

Contatta il venditore

Compra usato

EUR 51,22

Spedizione EUR 2,32
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Data Analysis With Python and PySpark

Rioux, Jonathan

Editore: Manning Publications, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Nuovo Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: New. Codice articolo 43997875-n

Contatta il venditore

Compra nuovo

EUR 56,72

Spedizione EUR 2,32
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Analysis with Python and PySpark

Rioux, Jonathan

Editore: Manning Publications Co., Shelter Island, NY, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Antico o usato Paperback

Da: PsychoBabel & Skoob Books, Didcot, Regno Unito

Valutazione del venditore 5 su 5 stelle

Paperback. Condizione: Very Good. Paperback in very good condition. Cover edges and corners are slightly bumped and rubbed. Covers are clean, binding is sound and content is as unread. LW. Used. Codice articolo 611289

Contatta il venditore

Compra usato

EUR 44,48

Spedizione EUR 14,59
Spedito da Regno Unito a U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Foto dell'editore

Data Analysis with Python and PySpark (Paperback)

Jonathan Rioux

Editore: Manning Publications, New York, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Nuovo Paperback

Da: Grand Eagle Retail, Bensenville, IL, U.S.A.

Valutazione del venditore 5 su 5 stelle

Paperback. Condizione: new. Paperback. When it comes to data analytics, it pays tothink big. PySpark blends the powerful Spark big data processing engine withthe Python programming language to provide a data analysis platform that can scaleup for nearly any task. Data Analysis with Python and PySpark is yourguide to delivering successful Python-driven data projects. Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Had oop-based clusters to Excel worksheets. You'll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs. The Spark data processing engine is an amazing analytics factory: raw data comes in,and insight comes out. Thanks to its ability to handle massive amounts of data distributed across a cluster, Spark has been adopted as standard by organizations both big and small. PySpark, which wraps the core Spark engine with a Python-based API, puts Spark-based data pipelines in the hands of programmers and data scientists working with the Python programming language. PySpark simplifies Spark's steep learning curve, and provides a seamless bridge between Spark and an ecosystem of Python-based data science tools. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Codice articolo 9781617297205

Contatta il venditore

Compra nuovo

EUR 59,11

Spedizione gratuita
Spedito in U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Analysis with Python and PySpark

Jonathan Rioux

Editore: Manning Publications, US, 2022

ISBN 10: 1617297208 ISBN 13: 9781617297205

Nuovo Paperback

Da: Rarewaves.com USA, London, LONDO, Regno Unito

Valutazione del venditore 5 su 5 stelle

Paperback. Condizione: New. When it comes to data analytics, it pays tothink big. PySpark blends the powerful Spark big data processing engine withthe Python programming language to provide a data analysis platform that can scaleup for nearly any task. Data Analysis with Python and PySpark is yourguide to delivering successful Python-driven data projects. Data Analysis with Python and PySpark is a carefully engineered tutorial that helps you use PySpark to deliver your data-driven applications at any scale. This clear and hands-on guide shows you how to enlarge your processing capabilities across multiple machines with data from any source, ranging from Had oop-based clusters to Excel worksheets. You'll learn how to break down big analysis tasks into manageable chunks and how to choose and use the best PySpark data abstraction for your unique needs. The Spark data processing engine is an amazing analytics factory: raw data comes in,and insight comes out. Thanks to its ability to handle massive amounts of data distributed across a cluster, Spark has been adopted as standard by organizations both big and small. PySpark, which wraps the core Spark engine with a Python-based API, puts Spark-based data pipelines in the hands of programmers and data scientists working with the Python programming language. PySpark simplifies Spark's steep learning curve, and provides a seamless bridge between Spark and an ecosystem of Python-based data science tools. Codice articolo LU-9781617297205

Contatta il venditore

Compra nuovo

EUR 59,33

Spedizione gratuita
Spedito da Regno Unito a U.S.A.

Quantit�: 1 disponibili

Aggiungi al carrello

Vedi altre 23 copie di questo libro

Vedi tutti i risultati per questo libro

Data Analysis With Python and PySpark - Brossura

Sinossi

Informazioni sull'autore

Risultati della ricerca per Data Analysis With Python and PySpark

Compra usato

Compra usato

Compra usato

Compra usato

Compra usato

Compra usato

Compra nuovo

Compra usato

Compra nuovo

Compra nuovo

Vedi altre 23 copie di questo libro