TY - BOOK AU - Raheem,Nasir TI - Big data: a tutorial-based approach T2 - CRC focus SN - 9780429060939 AV - QA76.9.B45 U1 - 005.7 23 PY - 2019///] CY - Boca Raton PB - CRC Press KW - Big data KW - Programmed instruction KW - COMPUTERS / Databases / General KW - bisacsh KW - COMPUTERS / General KW - COMPUTERS / Data Processing / General KW - COMPUTERS / Database Management / General N1 - Cover; Half Title; Title Page; Copyright Page; Dedication; Contents; List of Tutorials; List of Figures/Illustrations; Foreword; Preface; Acknowledgements; Author; Chapter 1: Introduction to Big Data; OVERVIEW; RAPID GROWTH OF BIG DATA; BIG DATA DEFINITION; BIG DATA PROJECTS; BUSINESS VALUE OF BIG DATA; Chapter 2: Big Data Implementation; OVERVIEW; HIGH-LEVEL TASKS TO IMPLEMENT INFORMATICA BDM, CLOUDERA HIVE, AND TABLEAU; BIG DATA TRIGGERS DIGITAL TRANSFORMATION OF THE PRODUCTION MODEL; BIG DATA CHALLENGES AND ASSOCIATED USE CASES; HADOOP INFRASTRUCTURE: OVERVIEW; HADOOP INFRASTRUCTURE: DEFINEDHyperconverged Hadoop Infrastructure; Compute Hardware Components; Network Hardware Components; Storage Hardware Architecture and Components; HADOOP ECO SYSTEM; HADOOP: JVM FRAMEWORK; HADOOP DISTRIBUTED FILE PROCESSING; MAPREDUCE SOFTWARE; MAPREDUCE SOFTWARE INSTALLATION; MAPREDUCE PROCESSING; Chapter 3: Big Data Use Cases; OVERVIEW; BIG DATA USE CASE: HEALTH; BIG DATA USE CASE: MANUFACTURING; BIG DATA USE CASE: INSURANCE; Chapter 4: Big Data Migration; OVERVIEW; CHALLENGES IN MIGRATING ORACLE DATA USING SQOOP; WHERE IS SQOOP USED?; SQOOP COMMANDS; HIVE ARGUMENTS USED BY SQOOPAPACHE SQOOP ARCHITECTURE; APACHE SQOOP COMMAND LINE INTERFACE; Chapter 5: Big Data Ingestion, Integration, and Management; OVERVIEW; INFORMATICA: MATURE AND COMPREHENSIVE BIG DATA SOLUTION; INFORMATICA DATA INTEGRATION; Chapter 6: Big Data Repository; OVERVIEW; DATA REPOSITORY LAYER; HIVE BIG DATA WAREHOUSE; SLOWLY CHANGING DIMENSION IN HIVE; HIVE METADATA: DEFINITIONS; INTEGRATED USE OF DATA INTEGRATION, DATA MANAGEMENT, AND DATA VISUALIZATION TOOLS; Chapter 7: Big Data Visualization; OVERVIEW; VARIABLE TYPES; Numbers; Strings; Factors; SUCCESS FACTORS FOR TABLEAUTABLEAU: STEP FORWARD IN DATA ANALYTICS; TABLEAU CONNECTORS FOR DATA SOURCES; TABLEAU DATA ENGINE TUNING; TABLEAU TUNING FEATURES; Fast Interactive Query Engine; Strategically Utilize Live Connections versus Extracts; Curate Data from the Data Lake; Optimize Data Extracts; Customize Tableau Connection Performance; Chapter 8: Structured and Un-Structured Data Analytics; OVERVIEW; TEXT ANALYTICS AS MEANS TO EXTRACT VALUE FROM UN-STRUCTURED DATA; MAJOR PLAYERS IN TEXT ANALYTICS; Decision Maker; Domain Expert; Linguist; Data Scientists; Conclusion; FROM DATA TO ACTION; CONCLUSIONChapter 9: Data Virtualization; OVERVIEW; Conclusion: Flexibility and Agility; Pre-Installation Steps to Set Up Denodo Development Environment; CONCLUSION; Chapter 10: Cloud Computing; OVERVIEW; A QUICK GLANCE AT CLOUD COMPUTING; Software as a Service (SaaS); Platform as a Service (PaaS); Infrastructure as a Service (IaaS); CLOUD COMPUTING VERSUS HADOOP PROCESSING; CLOUD SERVICE MOST SUITED FOR BIG DATA; Infrastructure as a Service (IaaS); Advantages of IaaS; CONCLUSION; SELF-ASSESSMENT QUIZ; ANSWERS TO THE SELF-ASSESSMENT QUIZ; REFERENCES; INDEX N2 - "This book explores the tools and techniques to bring about the marriage of structured and unstructured data. It focuses on Hadoop Distributed Storage and MapReduce Processing by implementing (i) Tools and Techniques of Hadoop Eco System, (ii) Hadoop Distributed File System Infrastructure, and (iii) efficient MapReduce processing. The book includes Use Cases and Tutorials to provide an integrated approach that answers the 'What', 'How', and 'Why' of Big Data"-- UR - https://www.taylorfrancis.com/books/9780429060939 UR - http://www.oclc.org/content/dam/oclc/forms/terms/vbrl-201703.pdf ER -