DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.
This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.
Inside you will learn:
• How to install and use DuckDB across Windows, macOS, Linux, Python, and R
• How to query raw CSV, Parquet, and JSON files directly without staging data
• Techniques for building ETL pipelines with DuckDB as the transformation layer
• Methods for joining and consolidating datasets across multiple file formats
• Integration with Pandas, Polars, and R dataframes for analytics and data science
• Preparing training datasets for machine learning with Scikit-Learn and PyTorch
• Performance optimizations including vectorization, caching, and parallel execution
• Using DuckDB with BI tools, dashboards, and embedded applications
• Strategies for cloud integration with S3, GCS, and Azure
• Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline design
Every chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.
You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.
Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.
Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows.
Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.
Da: GreatBookPrices, Columbia, MD, U.S.A.
Condizione: New. Codice articolo 51519598-n
Quantità: Più di 20 disponibili
Da: Grand Eagle Retail, Bensenville, IL, U.S.A.
Paperback. Condizione: new. Paperback. DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.Inside you will learn: - How to install and use DuckDB across Windows, macOS, Linux, Python, and R- How to query raw CSV, Parquet, and JSON files directly without staging data- Techniques for building ETL pipelines with DuckDB as the transformation layer- Methods for joining and consolidating datasets across multiple file formats- Integration with Pandas, Polars, and R dataframes for analytics and data science- Preparing training datasets for machine learning with Scikit-Learn and PyTorch- Performance optimizations including vectorization, caching, and parallel execution- Using DuckDB with BI tools, dashboards, and embedded applications- Strategies for cloud integration with S3, GCS, and Azure- Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline designEvery chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Codice articolo 9798265720399
Quantità: 1 disponibili
Da: GreatBookPrices, Columbia, MD, U.S.A.
Condizione: As New. Unread book in perfect condition. Codice articolo 51519598
Quantità: Più di 20 disponibili
Da: PBShop.store UK, Fairford, GLOS, Regno Unito
PAP. Condizione: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Codice articolo L0-9798265720399
Quantità: Più di 20 disponibili
Da: GreatBookPricesUK, Woodford Green, Regno Unito
Condizione: New. Codice articolo 51519598-n
Quantità: Più di 20 disponibili
Da: GreatBookPricesUK, Woodford Green, Regno Unito
Condizione: As New. Unread book in perfect condition. Codice articolo 51519598
Quantità: Più di 20 disponibili
Da: CitiRetail, Stevenage, Regno Unito
Paperback. Condizione: new. Paperback. DuckDB has quickly become one of the most practical tools for modern data analytics. Lightweight, fast, and simple to set up, it allows you to query massive datasets directly from files without the need for servers or expensive warehouses. Whether you are working with CSV, Parquet, or JSON, DuckDB makes analytics more efficient and more accessible.This book is a complete guide for analysts, data engineers, and developers who want to get the most out of DuckDB. It takes you step by step from installation and SQL fundamentals to advanced performance tuning and real-world projects. Instead of theory, it focuses on workflows you can apply immediately in Python, R, or directly from the command line.Inside you will learn: - How to install and use DuckDB across Windows, macOS, Linux, Python, and R- How to query raw CSV, Parquet, and JSON files directly without staging data- Techniques for building ETL pipelines with DuckDB as the transformation layer- Methods for joining and consolidating datasets across multiple file formats- Integration with Pandas, Polars, and R dataframes for analytics and data science- Preparing training datasets for machine learning with Scikit-Learn and PyTorch- Performance optimizations including vectorization, caching, and parallel execution- Using DuckDB with BI tools, dashboards, and embedded applications- Strategies for cloud integration with S3, GCS, and Azure- Hands-on projects covering sales analytics, ETL automation, predictive modeling, benchmarking, and full pipeline designEvery chapter combines clear explanations with code examples and end-of-section projects so you can reinforce your learning with practical exercises. SQL examples are paired with Python and R code, giving you the flexibility to adapt the workflows to your own environment.You will see how DuckDB compares to SQLite, Pandas, PostgreSQL, and cloud warehouses, and where it fits best in the modern data stack. You will also explore advanced topics like time-series optimization, partitioned data strategies, community extensions, and future ecosystem developments with MotherDuck and cloud services.Whether you are a beginner learning SQL, a data scientist preparing machine learning datasets, or a developer embedding analytics into applications, this book gives you the knowledge and confidence to put DuckDB to work effectively.Analytics should be fast, accessible, and free of unnecessary infrastructure. DuckDB makes this possible, and this guide shows you how to master it with practical, real-world workflows. This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Codice articolo 9798265720399
Quantità: 1 disponibili