Data Cleaning - Brossura

Ganti, Venkatesh ; Das, Anish

Brossura

ISBN 10: 3031007697 ISBN 13: 9783031007699

Casa editrice: Springer-Nature New York Inc, 2013

Vedi tutte le copie di questa edizione con ISBN

2 Usato

Da: EUR 30,02

11 Nuovo

Da: EUR 26,21

Data warehouses consolidate various activities of a business and often form the backbone for generating reports that support important business decisions. Errors in data tend to creep in for a variety of reasons. Some of these reasons include errors during input data collection and errors while merging data collected independently across different databases. These errors in data warehouses often result in erroneous upstream reports, and could impact business decisions negatively. Therefore, one of the critical challenges while maintaining large data warehouses is that of ensuring the quality of data in the data warehouse remains high. The process of maintaining high data quality is commonly referred to as data cleaning.In this book, we first discuss the goals of data cleaning. Often, the goals of data cleaning are not well defined and could mean different solutions in different scenarios. Toward clarifying these goals, we abstract out a common set of data cleaning tasks that often need to be addressed. This abstraction allows us to develop solutions for these common data cleaning tasks. We then discuss a few popular approaches for developing such solutions. In particular, we focus on an operator-centric approach for developing a data cleaning platform. The operator-centric approach involves the development of customizable operators that could be used as building blocks for developing common solutions. This is similar to the approach of relational algebra for query processing. The basic set of operators can be put together to build complex queries. Finally, we discuss the development of custom scripts which leverage the basic data cleaning operators along with relational operators to implement effective solutions for data cleaning tasks.

Le informazioni nella sezione "Riassunto" possono far riferimento a edizioni diverse di questo titolo.

Informazioni sull?autore

Venky Ganti is the co-founder and CTO of Alation Inc, where he is developing technology to effectively search, understand, and analyze structured and semi-structured data. Prior to Alation, he was a member of the Google Adwords engineering team for a few years. He helped develop the Dynamic Search Ads (DSA) product, whose goal is to completely automate the configuration and maintenance of AdWords campaigns based on an advertiser’s website and a few configuration parameters. ?e main technical challenge is to mine for appropriate keywords and automatically create high quality ads which match the accuracy and quality of manually configured campaigns. Prior to Google, Venky was a senior researcher at Microsoft Research (MSR). While at MSR, he worked extensively on data cleaning and integration technologies. Some of the technologies he helped develop in this context are now part of Microsoft SQL Server Integration Services, the ETL platform of Microsoft SQL Server. He also worked on leveraging rich structured databases on products, movies, people, etc., to enrich user experience for web search. Some of the tech nologies he helped develop are now part of the Bing product search. He has a Ph.D. in database systems and data mining from the University of Wisconsin-Madison.Anish Das Sarma is currently a Senior Research Scientist at Google (since May 2010), before which he was a Research Scientist at Yahoo (August 2009–April 2010). Prior to joining Yahoo research, Anish did his Ph.D. in Computer Science at Stanford University, advised by Prof. Jen nifer Widom. Anish received a B.Tech. in Computer Science and Engineering from the Indian Institute of Technology (IIT) Bombay in 2004, and an M.S. in Computer Science from Stan ford University in 2006. Anish is a recipient of the Microsoft Graduate Fellowship, a Stanford University School of Engineering fellowship, and the IIT-Bombay Dr. Shankar Dayal Sharma Gold Medal. Anish has written over 40 technical papers, filed over 10 patents, is associate edi tor of Sigmod Record, has served on the thesis committee of a Stanford Ph.D. student, and has served on numerous program committees. Two SIGMOD and one VLDB paper co-authored by Anish were selected among the best papers of the conference, with invitations to journals. While at Stanford, Anish co-founded Shout Velocity, a social tweet ranking system that was named a top-50 fbFund Finalist for most promising upcoming start-up ideas

Le informazioni nella sezione "Su questo libro" possono far riferimento a edizioni diverse di questo titolo.

Editore: Springer-Nature New York Inc
Data di pubblicazione: 2013
Lingua: Inglese
ISBN 10: 3031007697
ISBN 13: 9783031007699
Rilegatura: Copertina flessibile
Numero edizione: 1
Numero di pagine: 88
Contatto del produttore: Springer Nature Customer Service Center GmbH
ProductSafety@springernature.com

Europaplatz 3,69115 Heidelberg, Germany
Heidelberg
69115
Germania

Compra usato

Condizioni: come nuovo

Unread book in perfect condition...

Visualizza questo articolo

EUR 30,02

Spedizione EUR 2,28
Spedito in U.S.A.

Aggiungi al carrello

Compra nuovo

Visualizza questo articolo

EUR 26,21

Spedizione EUR 4,00
Spedito da Italia a U.S.A.

Aggiungi al carrello

Altre edizioni note dello stesso titolo

9781608456772: Data Cleaning: A Practical Perspective

Edizione in evidenza

ISBN 10: 1608456773 ISBN 13: 9781608456772
Casa editrice: Morgan & Claypool Publishers, 2013
Brossura

Springer, 2013 (Brossura)

Risultati della ricerca per Data Cleaning

Foto dell'editore

Data Cleaning (eng)

Ganti, Venkatesh

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Print on Demand

Da: Brook Bookstore On Demand, Napoli, NA, Italia

Valutazione del venditore 4 su 5 stelle

Condizione: new. Questo � un articolo print on demand. Codice articolo FITLJCFYTZ

Contatta il venditore

Compra nuovo

EUR 26,21

Spedizione EUR 4,00
Spedito da Italia a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: New. Codice articolo 44545656-n

Contatta il venditore

Compra nuovo

EUR 29,26

Spedizione EUR 2,28
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Data Cleaning (Synthesis Lectures on Data Management)

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Da: California Books, Miami, FL, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: New. Codice articolo I-9783031007699

Contatta il venditore

Compra nuovo

EUR 32,03

Spedizione gratuita
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Antico o usato Brossura

Da: GreatBookPrices, Columbia, MD, U.S.A.

Valutazione del venditore 5 su 5 stelle

Condizione: As New. Unread book in perfect condition. Codice articolo 44545656

Contatta il venditore

Compra usato

EUR 30,02

Spedizione EUR 2,28
Spedito in U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Data Cleaning (Synthesis Lectures on Data Management)

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Da: Ria Christie Collections, Uxbridge, Regno Unito

Valutazione del venditore 5 su 5 stelle

Condizione: New. In English. Codice articolo ria9783031007699_new

Contatta il venditore

Compra nuovo

EUR 31,04

Spedizione EUR 13,89
Spedito da Regno Unito a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Foto dell'editore

Data Cleaning

Ganti, Venkatesh

Editore: Springer 2013-10, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo PF

Da: Chiron Media, Wallingford, Regno Unito

Valutazione del venditore 5 su 5 stelle

PF. Condizione: New. Codice articolo 6666-IUK-9783031007699

Contatta il venditore

Compra nuovo

EUR 28,99

Spedizione EUR 17,96
Spedito da Regno Unito a U.S.A.

Quantit�: 10 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Da: GreatBookPricesUK, Woodford Green, Regno Unito

Valutazione del venditore 5 su 5 stelle

Condizione: New. Codice articolo 44545656-n

Contatta il venditore

Compra nuovo

EUR 30,68

Spedizione EUR 17,39
Spedito da Regno Unito a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Anish Das Sarma

Editore: Springer International Publishing Okt 2013, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Taschenbuch

Print on Demand

Da: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germania

Valutazione del venditore 5 su 5 stelle

Taschenbuch. Condizione: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Data warehouses consolidate various activities of a business and often form the backbone for generating reports that support important business decisions. Errors in data tend to creep in for a variety of reasons. Some of these reasons include errors during input data collection and errors while merging data collected independently across different databases. These errors in data warehouses often result in erroneous upstream reports, and could impact business decisions negatively. Therefore, one of the critical challenges while maintaining large data warehouses is that of ensuring the quality of data in the data warehouse remains high. The process of maintaining high data quality is commonly referred to as data cleaning. In this book, we first discuss the goals of data cleaning. Often, the goals of data cleaning are not well defined and could mean different solutions in different scenarios. Toward clarifying these goals, we abstract out a common set of data cleaning tasks that often need to be addressed. This abstraction allows us to develop solutions for these common data cleaning tasks. We then discuss a few popular approaches for developing such solutions. In particular, we focus on an operator-centric approach for developing a data cleaning platform. The operator-centric approach involves the development of customizable operators that could be used as building blocks for developing common solutions. This is similar to the approach of relational algebra for query processing. The basic set of operators can be put together to build complex queries. Finally, we discuss the development of custom scripts which leverage the basic data cleaning operators along with relational operators to implement effective solutions for data cleaning tasks. 88 pp. Englisch. Codice articolo 9783031007699

Contatta il venditore

Compra nuovo

EUR 26,74

Spedizione EUR 23,00
Spedito da Germania a U.S.A.

Quantit�: 2 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Ganti, Venkatesh; Sarma, Anish Das

Editore: Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Antico o usato Brossura

Da: GreatBookPricesUK, Woodford Green, Regno Unito

Valutazione del venditore 5 su 5 stelle

Condizione: As New. Unread book in perfect condition. Codice articolo 44545656

Contatta il venditore

Compra usato

EUR 35,86

Spedizione EUR 17,39
Spedito da Regno Unito a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Immagini fornite dal venditore

Data Cleaning

Ganti, Venkatesh|Sarma, Anish Das

Editore: Springer, Berlin|Springer International Publishing|Morgan & Claypool|Springer, 2013

ISBN 10: 3031007697 ISBN 13: 9783031007699

Nuovo Brossura

Print on Demand

Da: moluna, Greven, Germania

Valutazione del venditore 4 su 5 stelle

Condizione: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. Data warehouses consolidate various activities of a business and often form the backbone for generating reports that support important business decisions. Errors in data tend to creep in for a variety of reasons. Some of these reasons include errors during . Codice articolo 608129155

Contatta il venditore

Compra nuovo

EUR 25,86

Spedizione EUR 48,99
Spedito da Germania a U.S.A.

Quantit�: Pi� di 20 disponibili

Aggiungi al carrello

Vedi altre 3 copie di questo libro

Vedi tutti i risultati per questo libro

Data Cleaning - Brossura

Sinossi

Informazioni sull?autore

Altre edizioni note dello stesso titolo

Edizione in evidenza

Risultati della ricerca per Data Cleaning

Compra nuovo

Compra nuovo

Compra nuovo

Compra usato

Compra nuovo

Compra nuovo

Compra nuovo

Compra nuovo

Compra usato

Compra nuovo

Vedi altre 3 copie di questo libro