Описание: In this practical and comprehensive guide, author Martin Kleppmann helps you navigate this diverse landscape by examining the pros and cons of various technologies for processing and storing data.
Описание: This broad, deep, but not-too-technical guide introduces you to the fundamental principles of data science and walks you through the "data-analytic thinking" necessary for extracting useful knowledge and business value from the data you collect.
Автор: Brown Meta S. Название: Data Mining for Dummies ISBN: 1118893174 ISBN-13(EAN): 9781118893173 Издательство: Wiley Рейтинг: Цена: 5067.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Delve into your data for the key to success Data mining is quickly becoming integral to creating value and business momentum.
Автор: Dan Linstedt Название: Building a Scalable Data Warehouse with Data Vault 2.0 ISBN: 0128025107 ISBN-13(EAN): 9780128025109 Издательство: Elsevier Science Рейтинг: Цена: 9262.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание:
TheData Vault was invented by Dan Linstedt at the U.S. Department of Defense, and the standard has been successfully applied to data warehousing projects at organizations of different sizes, from small to large-size corporations. Due to its simplified design, which is adapted from nature, the Data Vault 2.0 standard helps prevent typical data warehousing failures.
"Building a Scalable Data Warehouse" covers everything one needs to know to create a scalable data warehouse end to end, including a presentation of the Data Vault modeling technique, which provides the foundations to create a technical data warehouse layer. The book discusses how to build the data warehouse incrementally using the agile Data Vault 2.0 methodology. In addition, readers will learn how to create the input layer (the stage layer) and the presentation layer (data mart) of the Data Vault 2.0 architecture including implementation best practices. Drawing upon years of practical experience and using numerous examples and an easy to understand framework, Dan Linstedt and Michael Olschimke discuss:
How to load each layer using SQL Server Integration Services (SSIS), including automation of the Data Vault loading processes.
Important data warehouse technologies and practices.
Data Quality Services (DQS) and Master Data Services (MDS) in the context of the Data Vault architecture.
Provides a complete introduction to data warehousing, applications, and the business context so readers can get-up and running fast
Explains theoretical concepts and provides hands-on instruction on how to build and implement a data warehouse
Demystifies data vault modeling with beginning, intermediate, and advanced techniques
Discusses the advantages of the data vault approach over other techniques, also including the latest updates to Data Vault 2.0 and multiple improvements to Data Vault 1.0
Описание: The data lake is a daring new approach for harnessing the power of big data technology and providing convenient self-service capabilities. But is it right for your company? This book is based on discussions with practitioners and executives from more than a hundred organizations.
Описание: The expanded edition of this practical book not only introduces you web scraping, but also serves as a comprehensive guide to scraping almost every type of data from the modern web.
Описание: Big Data Solution Architecture provides everyone from CIOs and COOs to lead architects and lead developers with the fundamental concepts of big data development. Authors Ted Malaska and Jonathan Seidman guide you through all the major components necessary to start, architect, and develop successful big data projects.
Описание: Learning about distributed systems, becoming familiar with technologies such as containers and functions, and knowing how to put everything together can be daunting. With this practical guide, you`ll get up to speed on patterns for building cloud native applications and best practices for common tasks such as messaging, eventing, and DevOps.
Описание: Highlights the pitfalls of data analysis and emphasizes the importance of using the appropriate metrics before making key decisions. Big data is often touted as the key to understanding almost every aspect of contemporary life. This critique of "information hubris" shows that even more important than data is finding the right metrics to evaluate it. The author, an expert in environmental design and city planning, examines the many ways in which we measure ourselves and our world. He dissects the metrics we apply to health, worker productivity, our children's education, the quality of our environment, the effectiveness of leaders, the dynamics of the economy, and the overall well-being of the planet. Among the areas where the wrong metrics have led to poor outcomes, he cites the fee-for-service model of health care, corporate cultures that emphasize time spent on the job while overlooking key productivity measures, overreliance on standardized testing in education to the detriment of authentic learning, and a blinkered focus on carbon emissions, which underestimates the impact of industrial damage to our natural world. He also examines various communities and systems that have achieved better outcomes by adjusting the ways in which they measure data. The best results are attained by those that have learned not only what to measure and how to measure it, but what it all means. By highlighting the pitfalls inherent in data analysis, this illuminating book reminds us that not everything that can be counted really counts.
Learn how to take full advantage of Apache Kafka, the distributed, publish-subscribe queue for handling real-time data feeds. With this comprehensive book, you ll understand how Kafka works and how it s designed. Authors Neha Narkhede, Gwen Shapira, and Todd Palino show you how to deploy production Kafka clusters; secure, tune, and monitor them; write rock-solid applications that use Kafka; and build scalable stream-processing applications.Learn how Kafka compares to other queues, and where it fits in the big data ecosystemDive into Kafka s internal designPick up best practices for developing applications that use KafkaUnderstand the best way to deploy Kafka in production monitoring, tuning, and maintenance tasksLearn how to secure a Kafka clusterGet detailed use-cases"
Автор: Chambers Bill, Zaharia Matei Название: Spark: The Definitive Guide: Big Data Processing Made Simple ISBN: 1491912219 ISBN-13(EAN): 9781491912218 Издательство: Wiley Рейтинг: Цена: 8869.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Learn how to use, deploy, and maintain Apache Spark with this comprehensive guide, written by the creators of the open-source cluster-computing framework. With an emphasis on improvements and new features in Spark 2.0, authors Bill Chambers and Matei Zaharia break down Spark topics into distinct sections, each with unique goals.
Автор: Kunigk Jan, George Lars, Wilkinson Paul Название: Architecting Modern Data Platforms2 ISBN: 149196927X ISBN-13(EAN): 9781491969274 Издательство: Wiley Рейтинг: Цена: 11403.00 р. Наличие на складе: Есть у поставщика Поставка под заказ.
Описание: Ideal for enterprise architects, IT managers, application architects, and data engineers, this book shows you how to overcome the many challenges that emerge during Hadoop projects.
ООО "Логосфера " Тел:+7(495) 980-12-10 www.logobook.ru