000 05526cam a2200637Ii 4500
001 9780429060939
003 FlBoTFG
005 20220531132352.0
006 m o d
007 cr cnu|||unuuu
008 190222t20192019flua ob 001 0 eng d
040 _aOCoLC-P
_beng
_erda
_epn
_cOCoLC-P
020 _a9780429060939
_q(electronic bk.)
020 _a0429060939
_q(electronic bk.)
020 _a9780429590511
_q(electronic bk.)
020 _a0429590512
_q(electronic bk.)
020 _a9780429588570
_q(electronic bk. : Mobipocket)
020 _a0429588577
_q(electronic bk. : Mobipocket)
020 _a9780429592454
_q(electronic bk. : PDF)
020 _a0429592450
_q(electronic bk. : PDF)
020 _z9780367183455
020 _z0367183455
024 8 _a10.1201/9780429060939
_2doi
035 _a(OCoLC)1087502351
_z(OCoLC)1088528690
_z(OCoLC)1089012949
035 _a(OCoLC-P)1087502351
050 4 _aQA76.9.B45
072 7 _aCOM
_x021000
_2bisacsh
072 7 _aCOM
_x000000
_2bisacsh
072 7 _aCOM
_x018000
_2bisacsh
072 7 _aUB
_2bicssc
082 0 4 _a005.7
_223
100 1 _aRaheem, Nasir,
_eauthor.
245 1 0 _aBig data :
_ba tutorial-based approach /
_cNasir Raheem.
250 _aFirst edition.
264 1 _aBoca Raton :
_bCRC Press,
_c[2019].
264 4 _c©2019
300 _a1 online resource :
_billustrations.
336 _atext
_btxt
_2rdacontent
337 _acomputer
_bc
_2rdamedia
338 _aonline resource
_bcr
_2rdacarrier
490 1 _aCRC focus
520 _a"This book explores the tools and techniques to bring about the marriage of structured and unstructured data. It focuses on Hadoop Distributed Storage and MapReduce Processing by implementing (i) Tools and Techniques of Hadoop Eco System, (ii) Hadoop Distributed File System Infrastructure, and (iii) efficient MapReduce processing. The book includes Use Cases and Tutorials to provide an integrated approach that answers the 'What', 'How', and 'Why' of Big Data"--
_cProvided by publisher.
505 0 _aCover; Half Title; Title Page; Copyright Page; Dedication; Contents; List of Tutorials; List of Figures/Illustrations; Foreword; Preface; Acknowledgements; Author; Chapter 1: Introduction to Big Data; OVERVIEW; RAPID GROWTH OF BIG DATA; BIG DATA DEFINITION; BIG DATA PROJECTS; BUSINESS VALUE OF BIG DATA; Chapter 2: Big Data Implementation; OVERVIEW; HIGH-LEVEL TASKS TO IMPLEMENT INFORMATICA BDM, CLOUDERA HIVE, AND TABLEAU; BIG DATA TRIGGERS DIGITAL TRANSFORMATION OF THE PRODUCTION MODEL; BIG DATA CHALLENGES AND ASSOCIATED USE CASES; HADOOP INFRASTRUCTURE: OVERVIEW
505 8 _aHADOOP INFRASTRUCTURE: DEFINEDHyperconverged Hadoop Infrastructure; Compute Hardware Components; Network Hardware Components; Storage Hardware Architecture and Components; HADOOP ECO SYSTEM; HADOOP: JVM FRAMEWORK; HADOOP DISTRIBUTED FILE PROCESSING; MAPREDUCE SOFTWARE; MAPREDUCE SOFTWARE INSTALLATION; MAPREDUCE PROCESSING; Chapter 3: Big Data Use Cases; OVERVIEW; BIG DATA USE CASE: HEALTH; BIG DATA USE CASE: MANUFACTURING; BIG DATA USE CASE: INSURANCE; Chapter 4: Big Data Migration; OVERVIEW; CHALLENGES IN MIGRATING ORACLE DATA USING SQOOP; WHERE IS SQOOP USED?; SQOOP COMMANDS
505 8 _aHIVE ARGUMENTS USED BY SQOOPAPACHE SQOOP ARCHITECTURE; APACHE SQOOP COMMAND LINE INTERFACE; Chapter 5: Big Data Ingestion, Integration, and Management; OVERVIEW; INFORMATICA: MATURE AND COMPREHENSIVE BIG DATA SOLUTION; INFORMATICA DATA INTEGRATION; Chapter 6: Big Data Repository; OVERVIEW; DATA REPOSITORY LAYER; HIVE BIG DATA WAREHOUSE; SLOWLY CHANGING DIMENSION IN HIVE; HIVE METADATA: DEFINITIONS; INTEGRATED USE OF DATA INTEGRATION, DATA MANAGEMENT, AND DATA VISUALIZATION TOOLS; Chapter 7: Big Data Visualization; OVERVIEW; VARIABLE TYPES; Numbers; Strings; Factors
505 8 _aSUCCESS FACTORS FOR TABLEAUTABLEAU: STEP FORWARD IN DATA ANALYTICS; TABLEAU CONNECTORS FOR DATA SOURCES; TABLEAU DATA ENGINE TUNING; TABLEAU TUNING FEATURES; Fast Interactive Query Engine; Strategically Utilize Live Connections versus Extracts; Curate Data from the Data Lake; Optimize Data Extracts; Customize Tableau Connection Performance; Chapter 8: Structured and Un-Structured Data Analytics; OVERVIEW; TEXT ANALYTICS AS MEANS TO EXTRACT VALUE FROM UN-STRUCTURED DATA; MAJOR PLAYERS IN TEXT ANALYTICS; Decision Maker; Domain Expert; Linguist; Data Scientists; Conclusion; FROM DATA TO ACTION
505 8 _aCONCLUSIONChapter 9: Data Virtualization; OVERVIEW; Conclusion: Flexibility and Agility; Pre-Installation Steps to Set Up Denodo Development Environment; CONCLUSION; Chapter 10: Cloud Computing; OVERVIEW; A QUICK GLANCE AT CLOUD COMPUTING; Software as a Service (SaaS); Platform as a Service (PaaS); Infrastructure as a Service (IaaS); CLOUD COMPUTING VERSUS HADOOP PROCESSING; CLOUD SERVICE MOST SUITED FOR BIG DATA; Infrastructure as a Service (IaaS); Advantages of IaaS; CONCLUSION; SELF-ASSESSMENT QUIZ; ANSWERS TO THE SELF-ASSESSMENT QUIZ; REFERENCES; INDEX
588 _aOCLC-licensed vendor bibliographic record.
650 0 _aBig data
_vProgrammed instruction.
650 7 _aCOMPUTERS / Databases / General
_2bisacsh
650 7 _aCOMPUTERS / General
_2bisacsh
650 7 _aCOMPUTERS / Data Processing / General
_2bisacsh
650 7 _aCOMPUTERS / Database Management / General
_2bisacsh
856 4 0 _3Taylor & Francis
_uhttps://www.taylorfrancis.com/books/9780429060939
856 4 2 _3OCLC metadata license agreement
_uhttp://www.oclc.org/content/dam/oclc/forms/terms/vbrl-201703.pdf
999 _c71383
_d71383