This book's got a bunch of handy recipes for data science pros to get them through the most common challenges they face when using Python tools and libraries. Each recipe shows you exactly how to do something step-by-step. You can load CSVs directly from a URL, flatten nested JSON, query SQL and NoSQL databases, import Excel sheets, or stream large files in memory-safe batches.
Once the data's loaded, you'll find simple ways to spot and fill in missing values, standardize categories that are off, clip outliers, normalize features, get rid of duplicates, and extract the year, month, or weekday from timestamps. You'll learn how to run quick analyses, like generating descriptive statistics, plotting histograms and correlation heatmaps, building pivot tables, creating scatter-matrix plots, and drawing time-series line charts to spot trends. You'll learn how to build polynomial features, compare MinMax, Standard, and Robust scaling, smooth data with rolling averages, apply PCA to reduce dimensions, and encode high-cardinality fields with sparse one-hot encoding using feature engineering recipes.
As for machine learning, you'll learn to put together end-to-end pipelines that handle imputation, scaling, feature selection, and modeling in one object, create custom transformers, automate hyperparameter searches with GridSearchCV, save and load your pipelines, and let SelectKBest pick the top features automatically. You'll learn how to test hypotheses with t-tests and chi-square tests, build linear and Ridge regressions, work with decision trees and random forests, segment countries using clustering, and evaluate models using MSE, classification reports, and ROC curves. And you'll finally get a handle on debugging and integration: fixing pandas merge errors, correcting NumPy broadcasting mismatches, and making sure your plots are consistent.
Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.
Da: California Books, Miami, FL, U.S.A.
Condizione: New. Codice articolo I-9789349174993
Quantità: Più di 20 disponibili
Da: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germania
Taschenbuch. Condizione: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -This book's got a bunch of handy recipes for data science pros to get them through the most common challenges they face when using Python tools and libraries. Each recipe shows you exactly how to do something step-by-step. You can load CSVs directly from a URL, flatten nested JSON, query SQL and NoSQL databases, import Excel sheets, or stream large files in memory-safe batches.Once the data's loaded, you'll find simple ways to spot and fill in missing values, standardize categories that are off, clip outliers, normalize features, get rid of duplicates, and extract the year, month, or weekday from timestamps. You'll learn how to run quick analyses, like generating descriptive statistics, plotting histograms and correlation heatmaps, building pivot tables, creating scatter-matrix plots, and drawing time-series line charts to spot trends. You'll learn how to build polynomial features, compare MinMax, Standard, and Robust scaling, smooth data with rolling averages, apply PCA to reduce dimensions, and encode high-cardinality fields with sparse one-hot encoding using feature engineering recipes.As for machine learning, you'll learn to put together end-to-end pipelines that handle imputation, scaling, feature selection, and modeling in one object, create custom transformers, automate hyperparameter searches with GridSearchCV, save and load your pipelines, and let SelectKBest pick the top features automatically. You'll learn how to test hypotheses with t-tests and chi-square tests, build linear and Ridge regressions, work with decision trees and random forests, segment countries using clustering, and evaluate models using MSE, classification reports, and ROC curves. And you'll finally get a handle on debugging and integration: fixing pandas merge errors, correcting NumPy broadcasting mismatches, and making sure your plots are consistent.Key LearningsYou can load remote CSVs directly into pandas using read_csv, so you don't have to deal with manual downloads and file clutter.Use json_normalize to convert nested JSON responses into simple tables, making it a breeze to analyze.You can query relational and NoSQL databases directly from Python, and the results will merge seamlessly into Pandas.Find and fill in missing values using IGNSA(), forward-fill, and median strategies for all of your data over time.You can free up a lot of memory by turning string columns into Pandas' Categorical dtype.You can speed up computations with NumPy vectorization and chunked CSV reading to prevent RAM exhaustion.You can build feature pipelines using custom transformers, scaling, and automated hyperparameter tuning with GridSearchCV.Use regression, tree-based, and clustering algorithms to show linear, nonlinear, and group-specific vaccination patterns.Evaluate models using MSE, R , precision, recall, and ROC curves to assess their performance.Set up automated data retrieval with scheduled API pulls, cloud storage, Kafka streams, and GraphQL queries.Table of ContentData Ingestion from Multiple SourcesPreprocessing and Cleaning Complex DatasetsPerforming Quick Exploratory AnalysisOptimizing Data Structures and PerformanceFeature Engineering and TransformationBuilding Machine Learning PipelinesImplementing Statistical and Machine Learning TechniquesDebugging and TroubleshootingAdvanced Data Retrieval and Integration 144 pp. Englisch. Codice articolo 9789349174993
Quantità: 2 disponibili
Da: CitiRetail, Stevenage, Regno Unito
Paperback. Condizione: new. Paperback. This book's got a bunch of handy recipes for data science pros to get them through the most common challenges they face when using Python tools and libraries. Each recipe shows you exactly how to do something step-by-step. You can load CSVs directly from a URL, flatten nested JSON, query SQL and NoSQL databases, import Excel sheets, or stream large files in memory-safe batches.Once the data's loaded, you'll find simple ways to spot and fill in missing values, standardize categories that are off, clip outliers, normalize features, get rid of duplicates, and extract the year, month, or weekday from timestamps. You'll learn how to run quick analyses, like generating descriptive statistics, plotting histograms and correlation heatmaps, building pivot tables, creating scatter-matrix plots, and drawing time-series line charts to spot trends. You'll learn how to build polynomial features, compare MinMax, Standard, and Robust scaling, smooth data with rolling averages, apply PCA to reduce dimensions, and encode high-cardinality fields with sparse one-hot encoding using feature engineering recipes.As for machine learning, you'll learn to put together end-to-end pipelines that handle imputation, scaling, feature selection, and modeling in one object, create custom transformers, automate hyperparameter searches with GridSearchCV, save and load your pipelines, and let SelectKBest pick the top features automatically. You'll learn how to test hypotheses with t-tests and chi-square tests, build linear and Ridge regressions, work with decision trees and random forests, segment countries using clustering, and evaluate models using MSE, classification reports, and ROC curves. And you'll finally get a handle on debugging and integration: fixing pandas merge errors, correcting NumPy broadcasting mismatches, and making sure your plots are consistent.Key LearningsYou can load remote CSVs directly into pandas using read_csv, so you don't have to deal with manual downloads and file clutter.Use json_normalize to convert nested JSON responses into simple tables, making it a breeze to analyze.You can query relational and NoSQL databases directly from Python, and the results will merge seamlessly into Pandas.Find and fill in missing values using IGNSA(), forward-fill, and median strategies for all of your data over time.You can free up a lot of memory by turning string columns into Pandas' Categorical dtype.You can speed up computations with NumPy vectorization and chunked CSV reading to prevent RAM exhaustion.You can build feature pipelines using custom transformers, scaling, and automated hyperparameter tuning with GridSearchCV.Use regression, tree-based, and clustering algorithms to show linear, nonlinear, and group-specific vaccination patterns.Evaluate models using MSE, R2, precision, recall, and ROC curves to assess their performance.Set up automated data retrieval with scheduled API pulls, cloud storage, Kafka streams, and GraphQL queries.Table of ContentData Ingestion from Multiple SourcesPreprocessing and Cleaning Complex DatasetsPerforming Quick Exploratory AnalysisOptimizing Data Structures and PerformanceFeature Engineering and TransformationBuilding Machine Learning PipelinesImplementing Statistical and Machine Learning TechniquesDebugging and TroubleshootingAdvanced Data Retrieval and Integration This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Codice articolo 9789349174993
Quantità: 1 disponibili
Da: buchversandmimpf2000, Emtmannsberg, BAYE, Germania
Taschenbuch. Condizione: Neu. This item is printed on demand - Print on Demand Titel. Neuware -This book's got a bunch of handy recipes for data science pros to get them through the most common challenges they face when using Python tools and libraries. Each recipe shows you exactly how to do something step-by-step. You can load CSVs directly from a URL, flatten nested JSON, query SQL and NoSQL databases, import Excel sheets, or stream large files in memory-safe batches.Once the data's loaded, you'll find simple ways to spot and fill in missing values, standardize categories that are off, clip outliers, normalize features, get rid of duplicates, and extract the year, month, or weekday from timestamps. You'll learn how to run quick analyses, like generating descriptive statistics, plotting histograms and correlation heatmaps, building pivot tables, creating scatter-matrix plots, and drawing time-series line charts to spot trends. You'll learn how to build polynomial features, compare MinMax, Standard, and Robust scaling, smooth data with rolling averages, apply PCA to reduce dimensions, and encode high-cardinality fields with sparse one-hot encoding using feature engineering recipes.As for machine learning, you'll learn to put together end-to-end pipelines that handle imputation, scaling, feature selection, and modeling in one object, create custom transformers, automate hyperparameter searches with GridSearchCV, save and load your pipelines, and let SelectKBest pick the top features automatically. You'll learn how to test hypotheses with t-tests and chi-square tests, build linear and Ridge regressions, work with decision trees and random forests, segment countries using clustering, and evaluate models using MSE, classification reports, and ROC curves. And you'll finally get a handle on debugging and integration: fixing pandas merge errors, correcting NumPy broadcasting mismatches, and making sure your plots are consistent.Key LearningsYou can load remote CSVs directly into pandas using read_csv, so you don't have to deal with manual downloads and file clutter.Use json_normalize to convert nested JSON responses into simple tables, making it a breeze to analyze.You can query relational and NoSQL databases directly from Python, and the results will merge seamlessly into Pandas.Find and fill in missing values using IGNSA(), forward-fill, and median strategies for all of your data over time.You can free up a lot of memory by turning string columns into Pandas' Categorical dtype.You can speed up computations with NumPy vectorization and chunked CSV reading to prevent RAM exhaustion.You can build feature pipelines using custom transformers, scaling, and automated hyperparameter tuning with GridSearchCV.Use regression, tree-based, and clustering algorithms to show linear, nonlinear, and group-specific vaccination patterns.Evaluate models using MSE, R , precision, recall, and ROC curves to assess their performance.Set up automated data retrieval with scheduled API pulls, cloud storage, Kafka streams, and GraphQL queries.Table of ContentData Ingestion from Multiple SourcesPreprocessing and Cleaning Complex DatasetsPerforming Quick Exploratory AnalysisOptimizing Data Structures and PerformanceFeature Engineering and TransformationBuilding Machine Learning PipelinesImplementing Statistical and Machine Learning TechniquesDebugging and TroubleshootingAdvanced Data Retrieval and IntegrationLibri GmbH, Europaallee 1, 36244 Bad Hersfeld 144 pp. Englisch. Codice articolo 9789349174993
Quantità: 1 disponibili
Da: AHA-BUCH GmbH, Einbeck, Germania
Taschenbuch. Condizione: Neu. Neuware - This book's got a bunch of handy recipes for data science pros to get them through the most common challenges they face when using Python tools and libraries. Each recipe shows you exactly how to do something step-by-step. You can load CSVs directly from a URL, flatten nested JSON, query SQL and NoSQL databases, import Excel sheets, or stream large files in memory-safe batches.Once the data's loaded, you'll find simple ways to spot and fill in missing values, standardize categories that are off, clip outliers, normalize features, get rid of duplicates, and extract the year, month, or weekday from timestamps. You'll learn how to run quick analyses, like generating descriptive statistics, plotting histograms and correlation heatmaps, building pivot tables, creating scatter-matrix plots, and drawing time-series line charts to spot trends. You'll learn how to build polynomial features, compare MinMax, Standard, and Robust scaling, smooth data with rolling averages, apply PCA to reduce dimensions, and encode high-cardinality fields with sparse one-hot encoding using feature engineering recipes.As for machine learning, you'll learn to put together end-to-end pipelines that handle imputation, scaling, feature selection, and modeling in one object, create custom transformers, automate hyperparameter searches with GridSearchCV, save and load your pipelines, and let SelectKBest pick the top features automatically. You'll learn how to test hypotheses with t-tests and chi-square tests, build linear and Ridge regressions, work with decision trees and random forests, segment countries using clustering, and evaluate models using MSE, classification reports, and ROC curves. And you'll finally get a handle on debugging and integration: fixing pandas merge errors, correcting NumPy broadcasting mismatches, and making sure your plots are consistent.Key LearningsYou can load remote CSVs directly into pandas using read_csv, so you don't have to deal with manual downloads and file clutter.Use json_normalize to convert nested JSON responses into simple tables, making it a breeze to analyze.You can query relational and NoSQL databases directly from Python, and the results will merge seamlessly into Pandas.Find and fill in missing values using IGNSA(), forward-fill, and median strategies for all of your data over time.You can free up a lot of memory by turning string columns into Pandas' Categorical dtype.You can speed up computations with NumPy vectorization and chunked CSV reading to prevent RAM exhaustion.You can build feature pipelines using custom transformers, scaling, and automated hyperparameter tuning with GridSearchCV.Use regression, tree-based, and clustering algorithms to show linear, nonlinear, and group-specific vaccination patterns.Evaluate models using MSE, R , precision, recall, and ROC curves to assess their performance.Set up automated data retrieval with scheduled API pulls, cloud storage, Kafka streams, and GraphQL queries.Table of ContentData Ingestion from Multiple SourcesPreprocessing and Cleaning Complex DatasetsPerforming Quick Exploratory AnalysisOptimizing Data Structures and PerformanceFeature Engineering and TransformationBuilding Machine Learning PipelinesImplementing Statistical and Machine Learning TechniquesDebugging and TroubleshootingAdvanced Data Retrieval and Integration. Codice articolo 9789349174993
Quantità: 2 disponibili
Da: preigu, Osnabrück, Germania
Taschenbuch. Condizione: Neu. Python Data Science Cookbook | Practical solutions across fast data cleaning, processing, and machine learning workflows with pandas, NumPy, and scikit-learn | Taryn Voska | Taschenbuch | Englisch | 2025 | GitforGits | EAN 9789349174993 | Verantwortliche Person für die EU: Libri GmbH, Europaallee 1, 36244 Bad Hersfeld, gpsr[at]libri[dot]de | Anbieter: preigu Print on Demand. Codice articolo 133469731
Quantità: 5 disponibili