Deep Learning for Genomics: Data-driven approaches for genomics applications in life sciences and biotechnology

Ebook609 pages4 hours

Deep Learning for Genomics: Data-driven approaches for genomics applications in life sciences and biotechnology

Name: Deep Learning for Genomics: Data-driven approaches for genomics applications in life sciences and biotechnology
Author: Upendra Kumar Devisetty
ISBN: 9781804613016

By Upendra Kumar Devisetty

Rating: 0 out of 5 stars

()

Read preview

About this ebook

Deep learning has shown remarkable promise in the field of genomics; however, there is a lack of a skilled deep learning workforce in this discipline. This book will help researchers and data scientists to stand out from the rest of the crowd and solve real-world problems in genomics by developing the necessary skill set. Starting with an introduction to the essential concepts, this book highlights the power of deep learning in handling big data in genomics. First, you’ll learn about conventional genomics analysis, then transition to state-of-the-art machine learning-based genomics applications, and finally dive into deep learning approaches for genomics. The book covers all of the important deep learning algorithms commonly used by the research community and goes into the details of what they are, how they work, and their practical applications in genomics. The book dedicates an entire section to operationalizing deep learning models, which will provide the necessary hands-on tutorials for researchers and any deep learning practitioners to build, tune, interpret, deploy, evaluate, and monitor deep learning models from genomics big data sets.
By the end of this book, you’ll have learned about the challenges, best practices, and pitfalls of deep learning for genomics.

Skip carousel

LanguageEnglish

PublisherPackt Publishing

Release dateNov 11, 2022

ISBN9781804613016

Author

Upendra Kumar Devisetty

Related authors

Skip carousel

Related to Deep Learning for Genomics

Related ebooks

Skip carousel

Interpretable Machine Learning with Python: Learn to build interpretable high-performance models with hands-on real-world examples
Ebook
Interpretable Machine Learning with Python: Learn to build interpretable high-performance models with hands-on real-world examples
bySerg Masís
Rating: 0 out of 5 stars
0 ratings
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
Ebook
Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning
byMargaux Masson-Forsythe
Rating: 0 out of 5 stars
0 ratings
Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks
Ebook
Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks
byBrian Murray
Rating: 0 out of 5 stars
0 ratings
The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques
Ebook
The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques
byEe Kin Chin
Rating: 0 out of 5 stars
0 ratings
Interpretable Machine Learning with Python: Build explainable, fair, and robust high-performance models with hands-on, real-world examples
Ebook
Interpretable Machine Learning with Python: Build explainable, fair, and robust high-performance models with hands-on, real-world examples
bySerg Masís
Rating: 0 out of 5 stars
0 ratings
Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system
Ebook
Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system
byMiroslaw Staron
Rating: 0 out of 5 stars
0 ratings
Practical Data Analysis: For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.
Ebook
Practical Data Analysis: For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.
byHector Cuesta
Rating: 0 out of 5 stars
0 ratings
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
Ebook
Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials
byChitra Lele
Rating: 0 out of 5 stars
0 ratings
Machine Learning for Imbalanced Data: Tackle imbalanced datasets using machine learning and deep learning techniques
Ebook
Machine Learning for Imbalanced Data: Tackle imbalanced datasets using machine learning and deep learning techniques
byAbhishek Kumar
Rating: 0 out of 5 stars
0 ratings
Synthetic Data for Machine Learning: Revolutionize your approach to machine learning with this comprehensive conceptual guide
Ebook
Synthetic Data for Machine Learning: Revolutionize your approach to machine learning with this comprehensive conceptual guide
byAbdulrahman Kerim
Rating: 0 out of 5 stars
0 ratings
Advanced Deep Learning for Engineers and Scientists: A Practical Approach
Ebook
Advanced Deep Learning for Engineers and Scientists: A Practical Approach
byKolla Bhanu Prakash
Rating: 0 out of 5 stars
0 ratings
Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning
Ebook
Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning
byManu Joseph
Rating: 0 out of 5 stars
0 ratings
Harnessing the Power of AI: A Guide to Making Technology Work for You
Ebook
Harnessing the Power of AI: A Guide to Making Technology Work for You
byRoy Hope
Rating: 0 out of 5 stars
0 ratings
Getting started with Deep Learning for Natural Language Processing: Learn how to build NLP applications with Deep Learning (English Edition)
Ebook
Getting started with Deep Learning for Natural Language Processing: Learn how to build NLP applications with Deep Learning (English Edition)
bySunil Patel
Rating: 0 out of 5 stars
0 ratings
Distributed Machine Learning with Python: Accelerating model training and serving with distributed systems
Ebook
Distributed Machine Learning with Python: Accelerating model training and serving with distributed systems
byGuanhua Wang
Rating: 0 out of 5 stars
0 ratings
Knowledge-Based Bioinformatics: From Analysis to Interpretation
Ebook
Knowledge-Based Bioinformatics: From Analysis to Interpretation
byGil Alterovitz
Rating: 0 out of 5 stars
0 ratings
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
Ebook
Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch
byVishnu Subramanian
Rating: 0 out of 5 stars
0 ratings
The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data
Ebook
The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data
byChris Kuo
Rating: 0 out of 5 stars
0 ratings
Neural Network Programming with Java
Ebook
Neural Network Programming with Java
bySouza Alan M.F.
Rating: 0 out of 5 stars
0 ratings
Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0
Ebook
Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0
bySvetlana Karslioglu
Rating: 0 out of 5 stars
0 ratings
Microservices Design Patterns in .NET: Making sense of microservices design and architecture using .NET Core
Ebook
Microservices Design Patterns in .NET: Making sense of microservices design and architecture using .NET Core
byTrevoir Williams
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Biotechnology and Life Sciences: Build machine learning models using Python and deploy them on the cloud
Ebook
Machine Learning in Biotechnology and Life Sciences: Build machine learning models using Python and deploy them on the cloud
bySaleh Alkhalifa
Rating: 0 out of 5 stars
0 ratings
Hands-on Scikit-Learn for Machine Learning Applications: Data Science Fundamentals with Python
Ebook
Hands-on Scikit-Learn for Machine Learning Applications: Data Science Fundamentals with Python
byDavid Paper
Rating: 0 out of 5 stars
0 ratings
R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet
Ebook
R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet
byMark Hodnett
Rating: 0 out of 5 stars
0 ratings
Metaprogramming with Python: A programmer's guide to writing reusable code to build smarter applications
Ebook
Metaprogramming with Python: A programmer's guide to writing reusable code to build smarter applications
bySulekha AloorRavi
Rating: 0 out of 5 stars
0 ratings
Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI
Ebook
Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI
byV Kishore Ayyadevara
Rating: 0 out of 5 stars
0 ratings
Practical Guide to Applied Conformal Prediction in Python: Learn and apply the best uncertainty frameworks to your industry applications
Ebook
Practical Guide to Applied Conformal Prediction in Python: Learn and apply the best uncertainty frameworks to your industry applications
byValery Manokhin
Rating: 0 out of 5 stars
0 ratings
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
Ebook
Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)
byDr. Rajkumar Tekchandani
Rating: 0 out of 5 stars
0 ratings
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
Ebook
Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems
byEyal Wirsansky
Rating: 0 out of 5 stars
0 ratings
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
Ebook
Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow
byFrank Millstein
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C. Lennox
Rating: 4 out of 5 stars
4/5
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
Ebook
So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen
byKristen Meinzer
Rating: 3 out of 5 stars
3/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 4 out of 5 stars
4/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5
AI for Educators: AI for Educators
Ebook
AI for Educators: AI for Educators
byMatt Miller
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
Ebook
ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
Ebook
Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit
byGuy Hart-Davis
Rating: 2 out of 5 stars
2/5
Three Story Method
Ebook series
Three Story Method
byJ. Thorn
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
Ebook
Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention
bySteven Cooper
Rating: 4 out of 5 stars
4/5
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
Ebook
The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications
byKavita Ganesan
Rating: 0 out of 5 stars
0 ratings
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 3 out of 5 stars
3/5
The Algorithm of the Universe (A New Perspective to Cognitive AI)
Ebook
The Algorithm of the Universe (A New Perspective to Cognitive AI)
byAncient Philosophy
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
The ChatGPT Handbook
Ebook
The ChatGPT Handbook
byPA BOOKS
Rating: 0 out of 5 stars
0 ratings
Coding with AI For Dummies
Ebook
Coding with AI For Dummies
byChris Minnick
Rating: 0 out of 5 stars
0 ratings

Related podcast episodes

Skip carousel

EP 38: Big Data in genomics - why we need 'the cloud' and AI to make sense of it all with Dr Maria Chatzou Dunford: Genomic data, is big data - so how do we actually make sense of this huge amount of data? And why should we use 'the cloud’ to store and analyse it? We talk to Dr Maria Chatzou Dunford, CEO and Co-Founder of LifeBit, a company that wants to democratise analysis of genetic big data.
Podcast episode
EP 38: Big Data in genomics - why we need 'the cloud' and AI to make sense of it all with Dr Maria Chatzou Dunford: Genomic data, is big data - so how do we actually make sense of this huge amount of data? And why should we use 'the cloud’ to store and analyse it? We talk to Dr Maria Chatzou Dunford, CEO and Co-Founder of LifeBit, a company that wants to democratise analysis of genetic big data.
byThe Genetics Podcast
0 ratings
0% found this document useful
#146 Viren Jain: How Google's AI is Pioneering Brain Mapping Research: This episode is sponsored by Celonis ,the global leader in process mining. AI has landed and enterprises are adapting. To give customers slick experiences and teams the technology to deliver. The road is long, but you’re closer than you think. Your...
Podcast episode
#146 Viren Jain: How Google's AI is Pioneering Brain Mapping Research: This episode is sponsored by Celonis ,the global leader in process mining. AI has landed and enterprises are adapting. To give customers slick experiences and teams the technology to deliver. The road is long, but you’re closer than you think. Your...
byEye On A.I.
0 ratings
0% found this document useful
Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
Podcast episode
Why and how is AI taking over the tissue image analysis field? w/ Jeppe Thagaard, Visiopharm
byDigital Pathology Podcast
0 ratings
0% found this document useful
Balancing Software-Driven Processes and Human Curation to Unlock Genomics Intelligence with Genomenon
Podcast episode
Balancing Software-Driven Processes and Human Curation to Unlock Genomics Intelligence with Genomenon
byData in Biotech
0 ratings
0% found this document useful
The Role of Infrastructure in ML // Niels Bantilan // #197
Podcast episode
The Role of Infrastructure in ML // Niels Bantilan // #197
byMLOps.community
0 ratings
0% found this document useful
Improving Software Engineering in Biostatistics with Daniel Sabanés Bové
Podcast episode
Improving Software Engineering in Biostatistics with Daniel Sabanés Bové
byAxial Podcast
0 ratings
0% found this document useful
What to consider when choosing an image analysis solution for phenotyping? (part 3) w/ Regan Baird, Visiopharm
Podcast episode
What to consider when choosing an image analysis solution for phenotyping? (part 3) w/ Regan Baird, Visiopharm
byDigital Pathology Podcast
0 ratings
0% found this document useful
#183 Dr. Miga and Dr. Phillippy on the Telomere to Telomere Consortium: Telomere to Telomere Consortium (T2T) co-chairs Drs. Miga and Phillippy discuss their experience using T2T to complete the human genome sequence.
Podcast episode
#183 Dr. Miga and Dr. Phillippy on the Telomere to Telomere Consortium: Telomere to Telomere Consortium (T2T) co-chairs Drs. Miga and Phillippy discuss their experience using T2T to complete the human genome sequence.
byDNA Today: A Genetics Podcast
0 ratings
0% found this document useful
The Evolution of Genomic Analysis with Sapient
Podcast episode
The Evolution of Genomic Analysis with Sapient
byData in Biotech
0 ratings
0% found this document useful
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
Podcast episode
eQMS in Academia: Practical Learning for Biomedical Engineering Students: Have you ever thought about the versatility of an eQMS? As it turns out, the use of one medical device eQMS solution in particular is extending across multiple sectors.In this episode of the Global Medical Device Podcast, Jon Speer talks to R...
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
Understanding The Immune System With Data At ImmunAI: An interview with Guy Yachdav about the work that he and his team are doing at ImmunAI to help researchers and scientists understand the immune system through data and machine learning.
Podcast episode
Understanding The Immune System With Data At ImmunAI: An interview with Guy Yachdav about the work that he and his team are doing at ImmunAI to help researchers and scientists understand the immune system through data and machine learning.
byData Engineering Podcast
0 ratings
0% found this document useful
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
Podcast episode
Privacy Engineering at CMU and Privacy Decision Making with Dr. Lorrie Cranor: Dr. Lorrie Cranor began her career in privacy 25 years ago and has been a professor at Carnegie Mellon University in the School of Computer Science for 19 years. Today, she serves as director and professor for the CMU privacy engineering program.In this ...
byPartially Redacted: Data, AI, Security, and Privacy
0 ratings
0% found this document useful
Open Source Software as a Triumph of Information Hiding, Modularity, and Creating Optionality with Dr. Gail Murphy: In this newest episode of The Idealcast, Gene Kim speaks with Dr. Gail Murphy, Professor of Computer Science and Vice President of Research and Innovation at the University of British Columbia. She is also the co-founder, board member, and former Chi...
Podcast episode
Open Source Software as a Triumph of Information Hiding, Modularity, and Creating Optionality with Dr. Gail Murphy: In this newest episode of The Idealcast, Gene Kim speaks with Dr. Gail Murphy, Professor of Computer Science and Vice President of Research and Innovation at the University of British Columbia. She is also the co-founder, board member, and former Chi...
byThe Idealcast with Gene Kim by IT Revolution
0 ratings
0% found this document useful
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
Podcast episode
Exploring Open-Source for Tissue Image Analysis and Data Science Business w/ Trevor McKee, Pathomics.io
byDigital Pathology Podcast
0 ratings
0% found this document useful
Weakly supervised AI for pathology w/ Geert Litjens, RadboudUMC
Podcast episode
Weakly supervised AI for pathology w/ Geert Litjens, RadboudUMC
byDigital Pathology Podcast
0 ratings
0% found this document useful
AI Ingenuity – Dr. Lisa Amini, Director, MIT-IBM Watson AI Lab – The Future of Machine Learning and Natural Language Processing in AI-based Products and Structures: Dr. Lisa Amini is the director of IBM Research Cambridge, which includes the MIT-IBM Watson AI Lab. Watson is a complex question-answering computer system that is capable of providing answers to questions that are directed in natural language; it was...
Podcast episode
AI Ingenuity – Dr. Lisa Amini, Director, MIT-IBM Watson AI Lab – The Future of Machine Learning and Natural Language Processing in AI-based Products and Structures: Dr. Lisa Amini is the director of IBM Research Cambridge, which includes the MIT-IBM Watson AI Lab. Watson is a complex question-answering computer system that is capable of providing answers to questions that are directed in natural language; it was...
byFinding Genius Podcast
0 ratings
0% found this document useful
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
Podcast episode
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
byEditors in Conversation
0 ratings
0% found this document useful
Episode 155: "The Future of AI" with Dr. Barry Devereux: In this episode of the Project Management Paradise podcast, Aaron Murphy interviews Dr. Barry Devereux, an expert in artificial intelligence (AI) and data analytics. Dr. Devereux discusses his background in cognitive science and his research on...
Podcast episode
Episode 155: "The Future of AI" with Dr. Barry Devereux: In this episode of the Project Management Paradise podcast, Aaron Murphy interviews Dr. Barry Devereux, an expert in artificial intelligence (AI) and data analytics. Dr. Devereux discusses his background in cognitive science and his research on...
byProject Management Paradise
0 ratings
0% found this document useful
Dennis Murphy: The Challenges With IT and OT Convergence
Podcast episode
Dennis Murphy: The Challenges With IT and OT Convergence
byThe PrOTect OT Cybersecurity Podcast
0 ratings
0% found this document useful
#338: Site Selection for Clinical Trials
Podcast episode
#338: Site Selection for Clinical Trials
byGlobal Medical Device Podcast powered by Greenlight Guru
0 ratings
0% found this document useful
Why and how should digital pathology be implemented into clinical practice? w/ Ralf Huss
Podcast episode
Why and how should digital pathology be implemented into clinical practice? w/ Ralf Huss
byDigital Pathology Podcast
0 ratings
0% found this document useful
AI-powered digital diagnostic tools for medical, veterinary and environmental laboratories. How Techcyte uses AI for digital cytology and smears w/ Ben Cahoon, Techcyte
Podcast episode
AI-powered digital diagnostic tools for medical, veterinary and environmental laboratories. How Techcyte uses AI for digital cytology and smears w/ Ben Cahoon, Techcyte
byDigital Pathology Podcast
0 ratings
0% found this document useful
55: The Ever Changing Technology in Veterinary Medicine
Podcast episode
55: The Ever Changing Technology in Veterinary Medicine
byThe Vet Blast Podcast presented by dvm360
0 ratings
0% found this document useful
#140 Isabelle Guyon: The Future of AI and Support Vector Machines: This episode is sponsored by MindStudio by YouAi. MindStudio is the best way to build an AI business. Start driving some serious revenue before everyone else. Mind Studio allows you to use conversational language to program incredibly powerful AI...
Podcast episode
#140 Isabelle Guyon: The Future of AI and Support Vector Machines: This episode is sponsored by MindStudio by YouAi. MindStudio is the best way to build an AI business. Start driving some serious revenue before everyone else. Mind Studio allows you to use conversational language to program incredibly powerful AI...
byEye On A.I.
0 ratings
0% found this document useful
Virtual Clinical Trials with Mike Novotny: Today we’re in the midst of the pandemic and virtual clinical trials are likely going to be a big part of our post-COVID-19 world. In this interview, I had the pleasure of speaking with Mike Novotny about virtual trials i.e. decentralized...
Podcast episode
Virtual Clinical Trials with Mike Novotny: Today we’re in the midst of the pandemic and virtual clinical trials are likely going to be a big part of our post-COVID-19 world. In this interview, I had the pleasure of speaking with Mike Novotny about virtual trials i.e. decentralized...
byClinical Trial Podcast | Conversations with Clinical Research Experts
0 ratings
0% found this document useful
22: Unleashing Effective QbD Strategies to Master Cell Therapy with Shin Kawamata - Part 2
Podcast episode
22: Unleashing Effective QbD Strategies to Master Cell Therapy with Shin Kawamata - Part 2
bySmart Biotech Scientist | Master Bioprocess CMC Development, Biologics Manufacturing & Scale-up for Busy Scientists
0 ratings
0% found this document useful
Why Experimental Design in Biotech is Broken and How to Fix It w/ Markus Gershater
Podcast episode
Why Experimental Design in Biotech is Broken and How to Fix It w/ Markus Gershater
byData in Biotech
0 ratings
0% found this document useful
EP116 SBOMs: A Step Towards a More Secure Software Supply Chain: Guest: , PM focused on Software Supply Chain Security @ Google Cooked questions: Why is everyone talking about SBOMs all of a sudden? Why does this matter to a typical security leader? Some software vendors don’t want SBOM, and this reminds us of...
Podcast episode
EP116 SBOMs: A Step Towards a More Secure Software Supply Chain: Guest: , PM focused on Software Supply Chain Security @ Google Cooked questions: Why is everyone talking about SBOMs all of a sudden? Why does this matter to a typical security leader? Some software vendors don’t want SBOM, and this reminds us of...
byCloud Security Podcast by Google
0 ratings
0% found this document useful
What’s Your Story: Ranveer Chandra
Podcast episode
What’s Your Story: Ranveer Chandra
byMicrosoft Research Podcast
0 ratings
0% found this document useful
CTP 024: Using Medical Records to Pre-Qualify for Clinical Trials with Komathi Stem: Using Medical Records to Pre-Qualify for Clinical Trials with Komathi Stem The traditional model involves sponsors and CROs contracting with trial sites and hoping the sites will find and enroll eligible patients. Through her work at...
Podcast episode
CTP 024: Using Medical Records to Pre-Qualify for Clinical Trials with Komathi Stem: Using Medical Records to Pre-Qualify for Clinical Trials with Komathi Stem The traditional model involves sponsors and CROs contracting with trial sites and hoping the sites will find and enroll eligible patients. Through her work at...
byClinical Trial Podcast | Conversations with Clinical Research Experts
0 ratings
0% found this document useful

Skip carousel

The Deep Learning Revolution For Artificial Intelligence
Facility Management
Article
The Deep Learning Revolution For Artificial Intelligence
Mar 28, 2019
3 min read
How To Make Sense From And With AI ?
The European Business Review
Article
How To Make Sense From And With AI ?
Sep 25, 2021
4 min read
Generative AI: What Leaders Need To Know
Rotman Management
Article
Generative AI: What Leaders Need To Know
Jan 1, 2024
12 min read
Why We Need To Fear The Risk Of AI Model Collapse
Evening Standard
Article
Why We Need To Fear The Risk Of AI Model Collapse
Dec 17, 2023
4 min read
The Gene Business
Business Today
Article
The Gene Business
Feb 20, 2020
9 min read
The Changing Scenarios of Digital Enterprises in 21st Century World
Techfastly
Article
The Changing Scenarios of Digital Enterprises in 21st Century World
Dec 1, 2021
5 min read
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
STAT
Article
Opinion: The FDA Needs To Set Standards For Using Artificial Intelligence In Drug Development
Nov 7, 2019
Poorly constructed AI algorithms for drug discovery and testing have the potential to cause harm. The FDA should play an important role in ensuring that AI-based drug development tools meet…
3 min read
Adoption of Cognitive Computing Across Various Industries
Techfastly
Article
Adoption of Cognitive Computing Across Various Industries
Dec 1, 2021
5 min read
PEOPLE ASSESSMENT in the Digital Age
The European Business Review
Article
PEOPLE ASSESSMENT in the Digital Age
May 25, 2021
8 min read
Federated Learning Uses The Data Right On Our Devices
Futurity
Article
Federated Learning Uses The Data Right On Our Devices
Jul 21, 2022
2 min read
You Had Questions For David Liu About CRISPR, Prime Editing, And Advice To Young Scientists. He Has Answers
STAT
Article
You Had Questions For David Liu About CRISPR, Prime Editing, And Advice To Young Scientists. He Has Answers
Nov 6, 2019
You had questions for David Liu about CRISPR, prime editing, and advice to young scientists. He has answers.
17 min read
Getting The edge
The European Business Review
Article
Getting The edge
Feb 25, 2021
7 min read
Challenging But Necessary: The AI Balancing Problem
Forbes Africa
Article
Challenging But Necessary: The AI Balancing Problem
Aug 8, 2024
Artificial intelligence (AI) continues transforming many industries, providing unprecedented opportunities for innovation and efficiency. However, these advancements bring complex challenges that necessitate a delicate balancing act. Developers, poli
3 min read
Things Get Strange When AI Starts Training Itself
The Atlantic
Article
Things Get Strange When AI Starts Training Itself
Feb 16, 2024
7 min read
5 QUESTIONS with: Diahan Southard -DNA Expert
Family Tree
Article
5 QUESTIONS with: Diahan Southard -DNA Expert
Nov 27, 2023
2 min read
Cambridge-1 And The Future Of Medicine
PC Pro Magazine
Article
Cambridge-1 And The Future Of Medicine
Sep 9, 2021
7 min read
In Tune With Technology
India Today
Article
In Tune With Technology
Jul 18, 2019
We are the beginning of the Fourth Industrial Revolution. Developments in genetics, artificial intelligence, robotics, nanotechnology, 3D printing and biotechnology are happening around us. This will lay the foundation for a revolution more comprehen
1 min read
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?: Despite new biology-like tools, some insist interpretation is impossible.
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
Is Artificial Intelligence Permanently Inscrutable?
Nautilus
Article
Is Artificial Intelligence Permanently Inscrutable?
Sep 1, 2016
Dmitry Malioutov can’t say much about what he built. As a research scientist at IBM, Malioutov spends part of his time building machine learning systems that solve difficult problems faced by IBM’s corporate clients. One such program was meant for a
13 min read
$3 Million Prize Won For An AI That Predicts Every Protein’s Structure
How It Works
Article
$3 Million Prize Won For An AI That Predicts Every Protein’s Structure
Oct 27, 2022
2 min read
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Forbes Africa
Article
Synthetic Data As A Double-Edged Sword In Africa's AI Revolution
Sep 29, 2023
Artificial intelligence (AI) is transforming companies and economies worldwide, including in Africa. Data is an essential component in the training of AI systems. Unfortunately, the lack of accurate, high-quality data is a significant impediment in A
3 min read
AI And Design: Questions Of Ethics
Architecture Australia
Article
AI And Design: Questions Of Ethics
Mar 4, 2024
Artificial intelligence (AI) is a very old idea, but the term AI and the field of AI as it relates to modern programmable digital computing have taken their contemporary forms in the past 70 years.1Today, we interact with AI technologies constantly,
5 min read
Arnab PANDEY
Techfastly
Article
Arnab PANDEY
Apr 1, 2021
11 min read
Cognitive Agents and Reinforcement of User Experience
Techfastly
Article
Cognitive Agents and Reinforcement of User Experience
Dec 1, 2021
3 min read
DeepMind’s AI Can ‘Predict How All Of Life’s Molecules Interact With Each Other’
Evening Standard
Article
DeepMind’s AI Can ‘Predict How All Of Life’s Molecules Interact With Each Other’
May 8, 2024
2 min read
Integrated Workplace Management Systems
Facility Management
Article
Integrated Workplace Management Systems
Dec 23, 2018
Property and facilities management are data-rich operating worlds. This is becoming even more complex as the Internet of Things (IoT) provides the capability to imbed sensors and diagnostic tools to monitor the use and performance of everything in re
4 min read
Trend Micro Maximum Security
PC Pro Magazine
Article
Trend Micro Maximum Security
Mar 9, 2023
SCORE PRICE (5 devices) First year, £25 (£30 inc VAT), renewal £88 (£105 inc VAT) from trendmicro.com/en_gb Trend Micro packs a lot of features into Maximum Security – many of them useful – but not enough to justify its price. The usual range of real
1 min read
Opinion: CRISPR-Cas9 Commercialization May Be Slowed By Delivery And Manufacturing Challenges
STAT
Article
Opinion: CRISPR-Cas9 Commercialization May Be Slowed By Delivery And Manufacturing Challenges
Feb 1, 2019
4 min read
Generativity: Driving The Promise Of Generative AI
The European Business Review
Article
Generativity: Driving The Promise Of Generative AI
May 31, 2023
7 min read
IoT Ransomware - How Healthcare Organizations Can Protect Themselves
Techfastly
Article
IoT Ransomware - How Healthcare Organizations Can Protect Themselves
Aug 2, 2021
5 min read

Related categories

Skip carousel

Reviews for Deep Learning for Genomics

Rating: 0 out of 5 stars

0 ratings

0 ratings0 reviews

Book preview

Deep Learning for Genomics - Upendra Kumar Devisetty

cover.png

BIRMINGHAM—MUMBAI

Deep Learning for Genomics

All rights reserved. No part of this book may be reproduced, stored in a retrieval system, or transmitted in any form or by any means, without the prior written permission of the publisher, except in the case of brief quotations embedded in critical articles or reviews.

Every effort has been made in the preparation of this book to ensure the accuracy of the information presented. However, the information contained in this book is sold without warranty, either express or implied. Neither the author(s), nor Packt Publishing or its dealers and distributors, will be held liable for any damages caused or alleged to have been caused directly or indirectly by this book.

Packt Publishing has endeavored to provide trademark information about all of the companies and products mentioned in this book by the appropriate use of capitals. However, Packt Publishing cannot guarantee the accuracy of this information.

Publishing Product Manager: Dhruv Jagdish Kataria

Content Development Editor: Priyanka Soam

Technical Editor: Rahul Limbachiya

Copy Editor: Safis Editing

Project Coordinator: Farheen Fathima

Proofreader: Safis Editing

Indexer: Rekha Nair

Production Designer: Mohamed Huzair

Marketing Coordinators: Shifa Ansari, Abeer Riyaz Dawe

First published: October 2022

Production reference: 1311022

Published by Packt Publishing Ltd.

Livery Place

35 Livery Street

Birmingham

B3 2PB, UK.

ISBN 978-1-80461-544-7

www.packt.com

Contributors

About the author

Upendra Kumar Devisetty has a Ph.D. in agriculture and over 12 years of experience working in Next-Generation Sequencing. He has a deep background in genomics and bioinformatics with a specialization in applying predictive analytics across a varied set of genomics problems in life sciences. Dr. Devisetty is currently working as a senior data science manager at Greenlight Biosciences, where he leads a team of bioinformatics scientists and data scientists to support the various bioinformatics and data science projects at Greenlight Biosciences with a mission to create mRNA-based solutions that can provide a cleaner environment and healthier people.

About the reviewer

Urminder Singh is a computer scientist and bioinformatician. His diverse research interests include understanding novel gene evolution, cancer genomics, machine learning in medicine, sociogenomics, and algorithms for big heterogeneous data. You can find him online at urmi-21.github.io.

Table of Contents

Preface

Part 1 – Machine Learning in Genomics

Introducing Machine Learning for Genomics

What is machine learning?

Why machine learning for genomics?

Machine learning for genomics in life sciences and biotechnology

Exploring machine learning software

Python programming language

Visualization

Biopython

Scikit-learn

Summary

Genomics Data Analysis

Technical requirements

Installing Biopython

Matplotlib

What is a genome?

Genome sequencing

Sanger sequencing of nucleic acids

Evolution of next-generation sequencing

Analysis of genomic data

Steps in genomics data analysis

Introduction to Biopython for genomic data analysis

What is Biopython?

Genomic data analysis use case – Sequence analysis of Covid-19

Calculating GC content

Calculating nucleotide content

Dinucleotide content

Modeling

Motif finder

Summary

Machine Learning Methods for Genomic Applications

Technical requirements

Python packages

ML libraries

Genomics big data

Supervised and unsupervised ML

Supervised ML

Unsupervised ML

ML for genomics

The basic workflow of ML in genomics

An ML use case for genomics – Disease prediction

Data collection

Data preprocessing

EDA

Data transformation

Data splitting

Model training

Model evaluation

ML challenges in genomics

Summary

Part 2 – Deep Learning for Genomic Applications

Deep Learning for Genomics

Understanding what deep learning is and how it works

Neural network definition

Anatomy of deep neural networks

Key concepts of DNNs

An example of how neural networks work

DNN architectures

DNNs for genomics

Deep learning workflow for genomics

Broad application of DNNs in genomics

Protein structure predictions

Regulatory genomics

Gene regulatory networks

Single-cell RNA sequencing

Introducing deep learning algorithms and Python libraries

General deep learning libraries

Deep learning libraries for genomics

Summary

Introducing Convolutional Neural Networks for Genomics

Introduction to CNNs

What are CNNs?

Transfer Learning

CNNs for genomics

Applications of CNNs in genomics

DeepBind

DeepInsight

DeepChrome

DeepVariant

Summary

Recurrent Neural Networks in Genomics

What are RNNs?

Introducing RNNs

How do RNNs work?

Different RNN architectures

Bidirectional RNNs (BiLSTM )

LSTMs and GRUs

Different types of RNNs

Applications and use cases of RNNs in genomics

DeepNano

ProLanGo

DanQ

Understanding RNNs through Transcription Factor Binding Site (TFBS) predictions

Summary

Unsupervised Deep Learning with Autoencoders

What is unsupervised DL?

Types of unsupervised DL

Clustering

Anomaly detection

Association

What are autoencoders?

Properties of autoencoders

How do autoencoders work?

Architecture of autoencoders

Types of autoencoders

Autoencoders for genomics

Gene expression

Use case – Predicting gene expression from TCGA pan-cancer RNA-Seq data using denoising autoencoders

Summary

GANs for Improving Models in Genomics

What are GANs?

Differences between Discriminative and Generative models

Intuition about GANs

How do GANs work?

Challenges working with genomics datasets

What is synthetic data?

How can GANs help improve models?

Practical applications of GANs in genomics

Analysis of ScRNA-Seq data

Generation of DNA

Using GANs for augmenting population-scale genomics data

Summary

Part 3 – Operationalizing models

Building and Tuning Deep Learning Models

Technical requirements

DL life cycle

Data processing

Data collection

Data wrangling

Feature engineering

Developing models

Selecting an appropriate algorithm

Model training

Tuning the models

Hyperparameter tuning

Hyperparameter tuning libraries

Classification metrics or performance statistics

Visualizing performance

Regression metrics

Use case – Predicting the binding site location of the JunD TF

Framing the TFBS prediction problem in terms of DL

Processing the data

Model training

Summary

Model Interpretability in Genomics

What is model interpretability?

Black-box model interpretability

Unlocking business value from model interpretability

Better business decisions

Building trust

Profitability

Model interpretability methods in genomics

Partial dependence plot

Individual conditional expectation

Permuted feature importance

Global surrogate

LIME

Shapley value

ExSum

Saliency map

Use case – Model interpretability for genomics

Data collection

Feature extraction

Target labels

Train-test split

Creating a CNN architecture

Summary

Model Deployment and Monitoring

Technical requirements

Streamlit

Hugging Face

Introducing model deployment

Steps in model deployment

Types of model deployment

Deploying models as services

A use case for deploying a DL model as a web service – building a Streamlit application of the CNN model

Monitoring models using advanced tools

Why monitor models?

Reasons for model degradation

How to monitor DL models

Advanced tools for model monitoring

Addressing drifts

Summary

Challenges, Pitfalls, and Best Practices for Deep Learning in Genomics

Deep learning challenges regarding genomics

Lack of flexible tools

Fewer biological samples

Computational resource requirements

Expertise in DL frameworks

Lack of high-quality labeled data

Lack of model interpretability

Common pitfalls for applying deep learning to genomics

Confounding

Data leakage

Imbalanced data

Improper model comparisons

Best practices for applying deep learning to genomics

Understand the problem and know your data better

A simple model for a simple problem

Establish a baseline for your model

Ensure reproducibility

Using pre-existing models for genomics

Do not reinvent the rule

Tune hyperparameters automatically

Focus on feature engineering

Normalize the data

Always perform model interpretation

Avoid overfitting

Summary

Index

Other Books You May Enjoy

Preface

Deep learning is the subset of machine learning based on artificial neural networks with representative learning using vast amounts of data. Machine learning is a subcomponent of artificial intelligence, which includes sophisticated algorithms that enable machines to mimic human intelligence to perform human tasks automatically. Both deep learning and machine learning help automatically detect meaningful patterns from data without explicit programming. Machine learning and deep learning have completely changed the way that we live these days. We rely on these so much that it’s hard to imagine a day without using any of these in some way or another, whether it is via the spam filtering of emails, product recommendations, or speech recognition. Both machine learning and specifically deep learning have been adopted by the scientific community in areas such as biology, genomics, bioinformatics, and computational biology. High-throughput technologies (HTS) such as next-generation sequencing (NGS) have made a significant contribution to genomics to study complex biological phenomena at a single-base-pair resolution on an unprecedented scale, facilitating an era of big data genomics. To get meaningful and novel biological insights from this big data, most of the algorithms are currently based on machine learning and, lately, deep learning methodologies to provide higher levels of accuracy in specific tasks related to genomics than state-of-the-art rule-based algorithms. Given the growing trend in the perception and application of machine learning and deep learning in genomics, research professionals, scientists, and managers require a good understanding of this exciting field to equip them with the necessary tools, technologies, and general guidelines to assist them in the selection of machine learning and deep learning methods for handling genomics data and accelerating data-driven decision-making in industries related to life sciences and biotechnology.

Throughout this book, we will learn how to apply deep learning approaches to solve real-world problems in genomics, interpret biological insights from deep learning models built from genomic datasets, and finally, operationalize deep learning models using open source tools to enable predictions for end users.

Who is this book for?

This book aims to practically introduce machine learning and deep learning for genomic applications that can transform genomics data into novel biological insights. It provides both the theoretical fundamentals and hands-on sections to give a taste of how machine learning and deep learning can be leveraged in real-world applications in the life sciences and biotech industries. This book covers a range of topics that are not currently available in other textbooks. The book also includes the challenges, pitfalls, and best practices when applying machine learning and deep learning to real-world scenarios. Each chapter of the book has code written in Python with industry-standard machine learning and deep learning libraries and frameworks such as Keras that the audience can reproduce in their working environment. This book is designed to cater to the needs of researchers, bioinformaticians, and data scientists in both academia and industry who want to leverage machine learning and deep learning technologies in genomic applications to extract insights from sets of big data. Managers and leaders who are already established in the life sciences and biotechnology sectors will not only find this book useful but can also adopt these methodologies to identify patterns, come up with predictions, and thereby contribute to data-driven decision-making in their respective companies.

The book is divided into three different parts. The first part introduces the fundamentals of genomic data analysis and machine learning. In this part, we will introduce the basic concept of genomic data analysis and discuss what machine learning is and why it is important for genomics and what value machine learning will bring to the life sciences and biotechnology industries. The second part will transition the readers from machine learning to deep learning and introduce them to the basic concepts of deep learning and diverse deep learning algorithms, using real-world examples to transform raw genomics data into biological insights. The final part will describe how to operationalize deep learning models using open source tools to enable predictions for end users. In this part, you will learn how to build and tune state-of-the-art machine learning models using Python and industry-standard libraries to derive biological insights from large amounts of multimodal genomic datasets and how to deploy these models on several cloud platforms such as AWS and Azure. The last chapter in the final part is fully dedicated to the current challenges for deep learning approaches to genomics and the potential pitfalls and how to avoid them using best practices.

What this book covers

Chapter 1, Introducing Machine Learning for Genomics, provides a brief history of the field of genomics and the practical application of machine learning methods to genomics, in addition to some of the technologies that this book will use.

Chapter 2, Genomics Data Analysis, gives readers a quick primer on data analysis in genomics. Using the Python programming language, readers will be able to make sense of the vast amounts of genomics data available and extract biological insights.

Chapter 3, Machine Learning Methods for Genomic Applications, introduces the reader to the two most important machine learning methods (supervised and unsupervised) and some of the important elements of standard machine learning pipelines. It also includes the practical real-world applications of supervised and unsupervised algorithms for genomics data analysis in the life sciences and biotechnology industries.

Chapter 4, Deep Learning for Genomics, will teach the reader about the fundamental concepts of deep learning, different types of deep learning models, and different deep learning Python libraries.

Chapter 5, Introducing Convolutional Neural Networks for Genomics, gives the reader a taste of Convolutional Neural Networks (CNNs), a type of deep neural network that is primarily used for sequence data, and shows how CNNs have superior performance compared to other deep learning methods.

Chapter 6, Recurrent Neural Networks in Genomics, introduces reinforcement learning techniques such as Recurrent Neural Networks (RNNs) and LSTMs and shows how they are currently being applied in several applications.

Chapter 7, Unsupervised Deep Learning with Autoencoders, introduces unsupervised deep learning, different methods of unsupervised deep learning, specifically Autoencoders, and its application in genomics.

Chapter 8, GANs for Improving Models in Genomics, introduces Generative Adversarial Networks (GANs) and how they can be used to improve deep neural networks trained on genomics datasets for predictive modeling.

Chapter 9, Building and Tuning Deep Learning Models, describes how to build and tune machine learning and deep learning models and deploy the final models across various computational systems and several platforms.

Chapter 10, Model Interpretability in Genomics, introduces the reader to how to interpret machine learning and deep learning models. The model interpretability introduced here helps readers to understand a model’s decision and why businesses are interested in model interpretability for creating trust, gaining profitability, and so on.

Chapter 11, Model Deployment and Monitoring, teaches the reader how to take the model they built on Google Colab and deploy it for predictions using open source tools such as Streamlit and Hugging Face. In addition, this chapter also describes how to monitor models using advanced tools and how monitoring is a key metric for businesses.

Chapter 12, Challenges, Pitfalls, and Best Practices for Deep Learning in Genomics, informs the reader of the challenges and pitfalls associated with applying machine learning and deep learning methodologies to genomics applications. It also covers the best practices for building end-to-end machine learning and deep learning models and applying them to genomic datasets.

To get the most out of this book

The book aims to keep it self-contained as possible. To extract the maximum value out of this book, a basic to intermediate knowledge of Python programming is recommended and a background in genomics, statistics, and bioinformatics and some knowledge of data science is a must. In addition, readers are expected to know the basics of machine learning and associated machine learning algorithms, such as regression and classification. The book provides a hands-on approach to implementation and associated deep learning methodologies that will have you up-and-running and productive in no time. At the end of the book, you will be able to put your knowledge to work with this practical guide.

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book’s GitHub repository. This will ensure you avoid any potential error related to copying and pasting of code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Deep-Learning-for-Genomics-. Any updates to the code will be reflected in the GitHub repository. We also have other code bundles from our rich catalog of books and videos available at: https://github.com/PacktPublishing/. Check them out!

Conventions used

There are several text conventions used throughout this book.

Code in text: Indicates code words in the text, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: Mount the downloaded WebStorm-10*.dmg disk image file as another disk in your system.

A block of code is set as follows:

# covid19_features.py

from Bio import SeqIO

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold: First, import all the relevant libraries:

>>> from Bio import SeqIO

Any command-line input or output is written as follows:

>>> from Bio import SeqIO

Bold: Indicates a new term, an important word, or words that you see onscreen. For instance, words in menus or dialog boxes appear in bold. Here is an example: In the Create the default IAM role pop-up window, select Any S3 bucket.

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at [email protected] and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at [email protected] with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

Share Your Thoughts

Once you’ve read Deep Learning for Genomics, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Download a free PDF copy of this book

Thanks for purchasing this book!

Do you like to read on the go but are unable to carry your print books everywhere?

Is your eBook purchase not compatible with the device of your choice?

Don’t worry, now with every Packt book you get a DRM-free PDF version of that book at no cost.

Read anywhere, any place, on any device. Search, copy, and paste code from your favorite technical books directly into your application.

The perks don’t stop there, you can get exclusive access to discounts, newsletters, and great free content in your inbox daily

Follow these simple steps to get the benefits:

Scan the QR code or visit the link below

https://packt.link/free-ebook/9781804615447

Submit your proof of purchase

That’s it! We’ll send your free PDF and other benefits to your email directly

Part 1 – Machine Learning in Genomics

This part will describe genomics data analysis and machine learning approaches to genomics. You will use state-of-the-art machine learning methods to transform raw genomics data into insights utilizing real-life examples in the life sciences and biotechnology industries.

This section comprises the following chapters:

Chapter 1, Introducing Machine Learning for Genomics

Chapter 2, Genomics Data Analysis

Chapter 3, Machine Learning Methods for Genomic Applications

1

Introducing Machine Learning for Genomics

Machine learning (ML) is the field of science that deals with developing computer algorithms and models that can perform certain tasks without explicitly programming them. This is to say, it teaches the machines to learn rather than specifying rules from input data provided to them. The machine then can convert that learning into expertise or knowledge and use that for predictions. ML is an important tool for leveraging technologies around artificial intelligence (AI), a subfield of computer science that aims to perform tasks automatically that we, as humans, are naturally good at. ML is an important aspect of all modern businesses and research. The adoption of ML for genomics applications is changing recently because of the availability of large genomic datasets, improvement in algorithms, and, most importantly, superior computational power. More and more scientific research organizations and industries are expanding the use of ML across vast volumes of genomic data for predictive diagnostics, as well as to get biological insights at the scale of population health.

Genomics, the study of the genetic constitution of organisms, holds promise in understanding and diagnosing human diseases or improving our agriculture and livestock. The field of genomics has seen exponential growth in the last 15 years, mainly due to recent technological advances in High-throughput sequencing also known as next-generation sequencing (NGS) technologies generating exponential amounts of genomics data. It is estimated that between 100 million and as many as 2 billion human genomes could be sequenced by 2025 (https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002195), representing an astounding growth of four to five orders of magnitude in 10 years and far exceeding the growth of many big data domains. This complexity and the sheer amount of data generated create roadblocks not only to the acquisition, storage, and distribution but also to genomic data analysis. The current tools used in the genomic analysis are built on top of deterministic approaches and rely on rules encoded to perform a particular task. To keep up with data growth, we need more and new innovative approaches, such as ML, in genomics to enrich our understanding of basic biology and subject them to applied research. In this chapter, we’ll learn what ML is, why ML is essential for genomics, and what value ML brings to life sciences and biotechnology industries that leverage genome data for the development of genomic-based products. By the end of this chapter, you will understand the limitations of the current conventional algorithms for genomic data analysis, how solving problems with ML is different from conventional approaches, and how ML approaches can fill in those gaps and make generating biological insights very easy.

As such, in this chapter, we’re going to cover the following main topics:

What is machine learning?

Why machine learning for genomics?

Machine learning for genomics in life sciences and biotechnology

What is machine learning?

Before we talk about ML, let’s understand what AI is. In the simplest terms, AI is the ability of a machine to mimic human intelligence and iteratively improve itself based on the information it collects. The goal of AI is to build systems to perform actions that are routinely done by humans such as problem-solving, pattern matching, image recognition, knowledge acquisition, and so on. ML, a subset of AI, is the process of training a model to learn and improve from experience. Deep learning (DL), in turn, is a subfield of ML, in which we leverage artificial neural networks (ANNs) to mimic the human brain and find the nonlinear relationships between the input and output to generate predictions (Figure 1.1):

Figure 1.1 – AI versus ML versus DL – how they are related

Figure 1.1 – AI versus ML versus DL – how they are related

In ML, a model is built based on input data and an underlying algorithm to make useful predictions from real-world data. In a simplified ML, features that represent an individual measurable property of the data are provided as input, and labels are returned as the predictions. Suppose we want to predict whether a particular sequence of DNA has a binding site for a transcription factor (TF) of your interest or not. Using the traditional approach, we would use a positional weight matrix (PWF) to scan the sequence and identify the potential motifs that are overrepresented. Even though this works, this is extremely difficult, manual, scalable, and so on. Using an

Enjoying the preview?

Page 1 of 1

Deep Learning for Genomics: Data-driven approaches for genomics applications in life sciences and biotechnology

About this ebook

Upendra Kumar Devisetty

Related authors

Related to Deep Learning for Genomics

Related ebooks

Interpretable Machine Learning with Python: Learn to build interpretable high-performance models with hands-on real-world examples

Active Machine Learning with Python: Refine and elevate data quality over quantity with active learning

Deep Learning with Python: A Comprehensive guide to Building and Training Deep Neural Networks using Python and popular Deep Learning Frameworks

The Deep Learning Architect's Handbook: Build and deploy production-ready DL solutions leveraging the latest Python techniques

Interpretable Machine Learning with Python: Build explainable, fair, and robust high-performance models with hands-on, real-world examples

Machine Learning Infrastructure and Best Practices for Software Engineers: Take your machine learning software from a prototype to a fully fledged software system

Practical Data Analysis: For small businesses, analyzing the information contained in their data using open source technology could be game-changing. All you need is some basic programming and mathematical skills to do just that.

Internet of Things (IoT) A Quick Start Guide: A to Z of IoT Essentials

Machine Learning for Imbalanced Data: Tackle imbalanced datasets using machine learning and deep learning techniques

Synthetic Data for Machine Learning: Revolutionize your approach to machine learning with this comprehensive conceptual guide

Advanced Deep Learning for Engineers and Scientists: A Practical Approach

Modern Time Series Forecasting with Python: Explore industry-ready time series forecasting using modern machine learning and deep learning

Harnessing the Power of AI: A Guide to Making Technology Work for You

Getting started with Deep Learning for Natural Language Processing: Learn how to build NLP applications with Deep Learning (English Edition)

Distributed Machine Learning with Python: Accelerating model training and serving with distributed systems

Knowledge-Based Bioinformatics: From Analysis to Interpretation

Deep Learning with PyTorch: A practical approach to building neural network models using PyTorch

The Handbook of NLP with Gensim: Leverage topic modeling to uncover hidden patterns, themes, and valuable insights within textual data

Neural Network Programming with Java

Reproducible Data Science with Pachyderm: Learn how to build version-controlled, end-to-end data pipelines using Pachyderm 2.0

Microservices Design Patterns in .NET: Making sense of microservices design and architecture using .NET Core

Machine Learning in Biotechnology and Life Sciences: Build machine learning models using Python and deploy them on the cloud

Hands-on Scikit-Learn for Machine Learning Applications: Data Science Fundamentals with Python

R Deep Learning Essentials.: A step-by-step guide to building deep learning models using TensorFlow, Keras, and MXNet

Metaprogramming with Python: A programmer's guide to writing reusable code to build smarter applications

Modern Computer Vision with PyTorch: A practical roadmap from deep learning fundamentals to advanced applications and Generative AI

Practical Guide to Applied Conformal Prediction in Python: Learn and apply the best uncertainty frameworks to your industry applications

Applied Deep Learning: Design and implement your own Neural Networks to solve real-world problems (English Edition)

Hands-On Genetic Algorithms with Python: Applying genetic algorithms to solve real-world deep learning and artificial intelligence problems

Machine Learning with Tensorflow: A Deeper Look at Machine Learning with TensorFlow

Intelligence (AI) & Semantics For You

Artificial Intelligence: A Guide for Thinking Humans

ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)

2084: Artificial Intelligence and the Future of Humanity

So You Want to Start a Podcast: Finding Your Voice, Telling Your Story, and Building a Community That Will Listen

Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees

ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind

Mastering ChatGPT: 21 Prompts Templates for Effortless Writing

Summary of Super-Intelligence From Nick Bostrom

ChatGPT For Dummies

Midjourney Mastery - The Ultimate Handbook of Prompts

ChatGPT For Fiction Writing: AI for Authors

101 Midjourney Prompt Secrets

Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates

AI for Educators: AI for Educators

Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)

ChatGPT Side Hustles 2024 - Unlock the Digital Goldmine and Get AI Working for You Fast with More Than 85 Side Hustle Ideas to Boost Passive Income, Create New Cash Flow, and Get Ahead of the Curve

Killer ChatGPT Prompts: Harness the Power of AI for Success and Profit

Three Story Method

The Secrets of ChatGPT Prompt Engineering for Non-Developers

Enterprise AI For Dummies

Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)

Neural Networks: A Practical Guide for Understanding and Programming Neural Networks and Useful Insights for Inspiring Reinvention

The Business Case for AI: A Leader's Guide to AI Strategies, Best Practices & Real-World Applications

Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures

The Algorithm of the Universe (A New Perspective to Cognitive AI)

Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert

AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python

Dancing with Qubits: How quantum computing works and how it can change the world

The ChatGPT Handbook

Coding with AI For Dummies

Related podcast episodes

Related articles

Related categories

Reviews for Deep Learning for Genomics

What did you think?

Book preview

Deep Learning for Genomics - Upendra Kumar Devisetty

1