+353-1-416-8900REST OF WORLD
+44-20-3973-8888REST OF WORLD
1-917-300-0470EAST COAST U.S
1-800-526-8630U.S. (TOLL FREE)

Principles and Practice of Big Data. Preparing, Sharing, and Analyzing Complex Information. Edition No. 2

  • Book

  • July 2018
  • Elsevier Science and Technology
  • ID: 4519426

Principles and Practice of Big Data: Preparing, Sharing, and Analyzing Complex Information, Second Edition updates and expands on the first edition, bringing a set of techniques and algorithms that are tailored to Big Data projects. The book stresses the point that most data analyses conducted on large, complex data sets can be achieved without the use of specialized suites of software (e.g., Hadoop), and without expensive hardware (e.g., supercomputers). The core of every algorithm described in the book can be implemented in a few lines of code using just about any popular programming language (Python snippets are provided).

Through the use of new multiple examples, this edition demonstrates that if we understand our data, and if we know how to ask the right questions, we can learn a great deal from large and complex data collections. The book will assist students and professionals from all scientific backgrounds who are interested in stepping outside the traditional boundaries of their chosen academic disciplines.

Please Note: This is an On Demand product, delivery may take up to 11 working days after payment has been received.

Table of Contents

1. Introduction 2. Providing Structure to Unstructured Data 3. Identification, Deidentification, and Reidentification 4. Metadata, Semantics, and Triples 5. Classifications and Ontologies 6. Introspection 7. Data Integration and Software Interoperability 8. Immutability and Immortality 9. Assessing the Adequacy of a Big Data Resource 10. Measurement 11. Indispensable Tips for Fast and Simple Big Data Analysis 12. Finding the Clues in Large Collections of Data 13. Using Random Numbers to Bring Your Big Data Analytic Problems Down to Size 14. Special Considerations in Big Data Analysis 15. Big Data Failures and How to Avoid (Some of) Them 16. Legalities 17. Data Sharing 18. Data Reanalysis: Much More Important than Analysis 19. Repurposing Big Data

Authors

Jules J. Berman Freelance author with expertise in informatics, computer programming, and cancer biology. Jules Berman holds two Bachelor of Science degrees from MIT (in Mathematics and in Earth and Planetary Sciences), a PhD from Temple University, and an MD from the University of Miami. He was a graduate researcher at the Fels Cancer Research Institute (Temple University) and at the American Health Foundation in Valhalla, New York. He completed his postdoctoral studies at the US National Institutes of Health, and his residency at the George Washington University Medical Center in Washington, DC. Dr. Berman served as Chief of anatomic pathology, surgical pathology, and cytopathology at the Veterans Administration Medical Center in Baltimore, Maryland, where he held joint appointments at the University of Maryland Medical Center and at the Johns Hopkins Medical Institutions. In 1998, he transferred to the US National Institutes of Health as a Medical Officer and as the Program Director for Pathology Informatics in the Cancer Diagnosis Program at the National Cancer Institute. Dr. Berman is a past President of the Association for Pathology Informatics and is the 2011 recipient of the Association's Lifetime Achievement Award. He is a listed author of more than 200 scientific publications and has written more than a dozen books in his three areas of expertise: informatics, computer programming, and pathology. Dr. Berman is currently a freelance writer.