Deep Learning for Medical Image Analysis

Ebook868 pages21 hours

Deep Learning for Medical Image Analysis

Name: Deep Learning for Medical Image Analysis
Brand: Academic Press
Rating: 4.3 (3 reviews)

By S. Kevin Zhou

Rating: 4.5 out of 5 stars

4.5/5

()

Read preview

About this ebook

Deep learning is providing exciting solutions for medical image analysis problems and is seen as a key method for future applications. This book gives a clear understanding of the principles and methods of neural network and deep learning concepts, showing how the algorithms that integrate deep learning as a core component have been applied to medical image detection, segmentation and registration, and computer-aided analysis, using a wide variety of application areas.

Deep Learning for Medical Image Analysis is a great learning resource for academic and industry researchers in medical imaging analysis, and for graduate students taking courses on machine learning and deep learning for computer vision and medical image computing and analysis.

Covers common research problems in medical image analysis and their challenges
Describes deep learning methods and the theories behind approaches for medical image analysis
Teaches how algorithms are applied to a broad range of application areas, including Chest X-ray, breast CAD, lung and chest, microscopy and pathology, etc.
Includes a Foreword written by Nicholas Ayache

Skip carousel

LanguageEnglish

PublisherAcademic Press

Release dateJan 18, 2017

ISBN9780128104095

Related to Deep Learning for Medical Image Analysis

Related ebooks

Skip carousel

Machine Learning and Medical Imaging
Ebook
Machine Learning and Medical Imaging
byGuorong Wu
Rating: 2 out of 5 stars
2/5
Machine Learning, Big Data, and IoT for Medical Informatics
Ebook
Machine Learning, Big Data, and IoT for Medical Informatics
byPardeep Kumar
Rating: 0 out of 5 stars
0 ratings
Deep Learning Techniques for Biomedical and Health Informatics
Ebook
Deep Learning Techniques for Biomedical and Health Informatics
byBasant Agarwal
Rating: 0 out of 5 stars
0 ratings
Computer Vision for Assistive Healthcare
Ebook
Computer Vision for Assistive Healthcare
byLeo Marco
Rating: 0 out of 5 stars
0 ratings
Deep Learning in Bioinformatics: Techniques and Applications in Practice
Ebook
Deep Learning in Bioinformatics: Techniques and Applications in Practice
byHabib Izadkhah
Rating: 0 out of 5 stars
0 ratings
Machine Learning in Bio-Signal Analysis and Diagnostic Imaging
Ebook
Machine Learning in Bio-Signal Analysis and Diagnostic Imaging
byNilanjan Dey
Rating: 0 out of 5 stars
0 ratings
Control Theory in Biomedical Engineering: Applications in Physiology and Medical Robotics
Ebook
Control Theory in Biomedical Engineering: Applications in Physiology and Medical Robotics
byOlfa Boubaker
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence in Healthcare
Ebook
Artificial Intelligence in Healthcare
byAdam Bohr
Rating: 0 out of 5 stars
0 ratings
Artificial Intelligence and Big Data Analytics for Smart Healthcare
Ebook
Artificial Intelligence and Big Data Analytics for Smart Healthcare
byMiltiadis Lytras
Rating: 0 out of 5 stars
0 ratings
Quality and Safety in Neurosurgery
Ebook
Quality and Safety in Neurosurgery
byDaniel J. Guillaume
Rating: 0 out of 5 stars
0 ratings
Handbook of Deep Learning in Biomedical Engineering: Techniques and Applications
Ebook
Handbook of Deep Learning in Biomedical Engineering: Techniques and Applications
byValentina Emilia Balas
Rating: 0 out of 5 stars
0 ratings
Data Analytics in Biomedical Engineering and Healthcare
Ebook
Data Analytics in Biomedical Engineering and Healthcare
byKun Chang Lee
Rating: 0 out of 5 stars
0 ratings
Wearable Sensors: Fundamentals, Implementation and Applications
Ebook
Wearable Sensors: Fundamentals, Implementation and Applications
byEdward Sazonov
Rating: 0 out of 5 stars
0 ratings
Biomedical Imaging Instrumentation: Applications in Tissue, Cellular and Molecular Diagnostics
Ebook
Biomedical Imaging Instrumentation: Applications in Tissue, Cellular and Molecular Diagnostics
byMrutyunjay Suar
Rating: 0 out of 5 stars
0 ratings
Applications of Big Data in Healthcare: Theory and Practice
Ebook
Applications of Big Data in Healthcare: Theory and Practice
byAshish Khanna
Rating: 0 out of 5 stars
0 ratings
Cognitive and Soft Computing Techniques for the Analysis of Healthcare Data
Ebook
Cognitive and Soft Computing Techniques for the Analysis of Healthcare Data
byAkash Kumar Bhoi
Rating: 0 out of 5 stars
0 ratings
Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics
Ebook
Demystifying Big Data, Machine Learning, and Deep Learning for Healthcare Analytics
byPradeep N
Rating: 0 out of 5 stars
0 ratings
Intelligent Biomechatronics in Neurorehabilitation
Ebook
Intelligent Biomechatronics in Neurorehabilitation
byXiaoling Hu
Rating: 0 out of 5 stars
0 ratings
Computational Modeling in Bioengineering and Bioinformatics
Ebook
Computational Modeling in Bioengineering and Bioinformatics
byNenad Filipovic
Rating: 0 out of 5 stars
0 ratings
Biomedical Signal Analysis for Connected Healthcare
Ebook
Biomedical Signal Analysis for Connected Healthcare
bySridhar Krishnan
Rating: 0 out of 5 stars
0 ratings
Deep Learning for Data Analytics: Foundations, Biomedical Applications, and Challenges
Ebook
Deep Learning for Data Analytics: Foundations, Biomedical Applications, and Challenges
byHimansu Das
Rating: 0 out of 5 stars
0 ratings
Deep Learning Models for Medical Imaging
Ebook
Deep Learning Models for Medical Imaging
byKC Santosh
Rating: 0 out of 5 stars
0 ratings
Feature Extraction and Image Processing for Computer Vision
Ebook
Feature Extraction and Image Processing for Computer Vision
byMark Nixon
Rating: 4 out of 5 stars
4/5
Deep Learning for Robot Perception and Cognition
Ebook
Deep Learning for Robot Perception and Cognition
byAlexandros Iosifidis
Rating: 4 out of 5 stars
4/5
Machine Learning: A Bayesian and Optimization Perspective
Ebook
Machine Learning: A Bayesian and Optimization Perspective
bySergios Theodoridis
Rating: 3 out of 5 stars
3/5
Practical Guide for Biomedical Signals Analysis Using Machine Learning Techniques: A MATLAB Based Approach
Ebook
Practical Guide for Biomedical Signals Analysis Using Machine Learning Techniques: A MATLAB Based Approach
byAbdulhamit Subasi
Rating: 5 out of 5 stars
5/5
Deep Learning with Keras
Ebook
Deep Learning with Keras
bySujit Pal
Rating: 5 out of 5 stars
5/5
MATLAB for Neuroscientists: An Introduction to Scientific Computing in MATLAB
Ebook
MATLAB for Neuroscientists: An Introduction to Scientific Computing in MATLAB
byPascal Wallisch
Rating: 5 out of 5 stars
5/5
Pattern Recognition and Machine Learning
Ebook
Pattern Recognition and Machine Learning
byY. Anzai
Rating: 0 out of 5 stars
0 ratings
Pattern Recognition and Signal Analysis in Medical Imaging
Ebook
Pattern Recognition and Signal Analysis in Medical Imaging
byAnke Meyer-Baese
Rating: 0 out of 5 stars
0 ratings

Intelligence (AI) & Semantics For You

Skip carousel

ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
Ebook
ChatGPT for Beginners: How to Make Money Online and 10x Your Productivity Using ChatGPT Even if You’re an Absolute Beginner (The Complete Up-to-Date ChatGPT Guide)
byMatthew Hayes
Rating: 0 out of 5 stars
0 ratings
2084: Artificial Intelligence and the Future of Humanity
Ebook
2084: Artificial Intelligence and the Future of Humanity
byJohn C Lennox
Rating: 4 out of 5 stars
4/5
Summary of Super-Intelligence From Nick Bostrom
Ebook
Summary of Super-Intelligence From Nick Bostrom
bySummary Station
Rating: 5 out of 5 stars
5/5
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
Ebook
Data Science from Scratch: The #1 Data Science Guide for Everything A Data Scientist Needs to Know: Python, Linear Algebra, Statistics, Coding, Applications, Neural Networks, and Decision Trees
bySteven Cooper
Rating: 4 out of 5 stars
4/5
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
Ebook
Mastering ChatGPT: 21 Prompts Templates for Effortless Writing
byCea West
Rating: 5 out of 5 stars
5/5
Dark Aeon: Transhumanism and the War Against Humanity
Ebook
Dark Aeon: Transhumanism and the War Against Humanity
byJoe Allen
Rating: 5 out of 5 stars
5/5
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
Ebook
Python for Beginners. A Smarter Way to Learn Python in 5 Days and Remember it Longer. With Easy Step by Step Guidance and Hands on Examples. (Python Crash Course-Programming for Beginners)
byArthur T. Brooks
Rating: 0 out of 5 stars
0 ratings
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
Ebook
CompTIA Certification: The Ultimate Guide To Discover CompTIA. Certified Quickly And Easily Passing The Certification Exam. Real Practice Test With Detailed Screenshots, Answers And Explanations
byDavid Mayer
Rating: 0 out of 5 stars
0 ratings
ChatGPT For Fiction Writing: AI for Authors
Ebook
ChatGPT For Fiction Writing: AI for Authors
byNova Leigh
Rating: 5 out of 5 stars
5/5
Artificial Intelligence: A Guide for Thinking Humans
Ebook
Artificial Intelligence: A Guide for Thinking Humans
byMelanie Mitchell
Rating: 4 out of 5 stars
4/5
The Secrets of ChatGPT Prompt Engineering for Non-Developers
Ebook
The Secrets of ChatGPT Prompt Engineering for Non-Developers
byCea West
Rating: 5 out of 5 stars
5/5
ChatGPT For Dummies
Ebook
ChatGPT For Dummies
byPam Baker
Rating: 0 out of 5 stars
0 ratings
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
Ebook
Rise of Generative AI and ChatGPT: Understand how Generative AI and ChatGPT are transforming and reshaping the business world (English Edition)
byUtpal Chakraborty
Rating: 0 out of 5 stars
0 ratings
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
Ebook
What Makes Us Human: An Artificial Intelligence Answers Life's Biggest Questions
byJasmine Wang
Rating: 5 out of 5 stars
5/5
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
Ebook
Mastering ChatGPT: Create Highly Effective Prompts, Strategies, and Best Practices to Go From Novice to Expert
byTJ Books
Rating: 3 out of 5 stars
3/5
101 Midjourney Prompt Secrets
Ebook
101 Midjourney Prompt Secrets
byMarcus Byrne
Rating: 3 out of 5 stars
3/5
Enterprise AI For Dummies
Ebook
Enterprise AI For Dummies
byZachary Jarvinen
Rating: 3 out of 5 stars
3/5
Our Final Invention: Artificial Intelligence and the End of the Human Era
Ebook
Our Final Invention: Artificial Intelligence and the End of the Human Era
byJames Barrat
Rating: 4 out of 5 stars
4/5
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
Ebook
Summary of Building a Second Brain: by Tiago Forte - A Proven Method to Organize Your Digital Life and Unlock Your Creative Potential - A Comprehensive Summary
byAlexander Cooper
Rating: 1 out of 5 stars
1/5
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
Ebook
Chat-GPT Income Ideas: Pioneering Monetization Concepts Utilizing Conversational AI for Profitable Ventures
byThe Passive Income Strategist
Rating: 4 out of 5 stars
4/5
Dancing with Qubits: How quantum computing works and how it can change the world
Ebook
Dancing with Qubits: How quantum computing works and how it can change the world
byRobert S. Sutor
Rating: 5 out of 5 stars
5/5
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
Ebook
AI Crash Course: A fun and hands-on introduction to machine learning, reinforcement learning, deep learning, and artificial intelligence with Python
byHadelin de Ponteves
Rating: 0 out of 5 stars
0 ratings
Impromptu: Amplifying Our Humanity Through AI
Ebook
Impromptu: Amplifying Our Humanity Through AI
byReid Hoffman
Rating: 5 out of 5 stars
5/5
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
Ebook
ChatGPT Ultimate User Guide - How to Make Money Online Faster and More Precise Using AI Technology
byMaximus Wilson
Rating: 0 out of 5 stars
0 ratings
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
Ebook
Python Machine Learning - Third Edition: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd Edition
bySebastian Raschka
Rating: 5 out of 5 stars
5/5
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
Ebook
ChatGPT Money Machine 2024 - The Ultimate Chatbot Cheat Sheet to Go From Clueless Noob to Prompt Prodigy Fast! Complete AI Beginner’s Course to Catch the GPT Gold Rush Before It Leaves You Behind
byAlec Rowe
Rating: 0 out of 5 stars
0 ratings
ChatGPT: The Future of Intelligent Conversation
Ebook
ChatGPT: The Future of Intelligent Conversation
byCea West
Rating: 4 out of 5 stars
4/5
Midjourney Mastery - The Ultimate Handbook of Prompts
Ebook
Midjourney Mastery - The Ultimate Handbook of Prompts
byAndreea Todinca
Rating: 5 out of 5 stars
5/5
The Insane ChatGPT Millionaire Guide
Ebook
The Insane ChatGPT Millionaire Guide
byC Edmiston
Rating: 0 out of 5 stars
0 ratings
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
Ebook
Creating Online Courses with ChatGPT | A Step-by-Step Guide with Prompt Templates
byCea West
Rating: 4 out of 5 stars
4/5

Related podcast episodes

Skip carousel

One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
Podcast episode
One Shot and Metric Learning - Quadruplet Loss (Machine Learning Dojo)
byMachine Learning Street Talk (MLST)
0 ratings
0% found this document useful
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
Podcast episode
#70 Beyond the Language Wars: R & Python for the Modern Data Scientist
byDataFramed
0 ratings
0% found this document useful
Systems Thinking (Feat. Dr Alison Rodrias)
Podcast episode
Systems Thinking (Feat. Dr Alison Rodrias)
byBA Brew - A Business Analysis Podcast
0 ratings
0% found this document useful
Research Review of FASHION UK from PEDro: Research Review of FASHION UK from PEDro
Podcast episode
Research Review of FASHION UK from PEDro: Research Review of FASHION UK from PEDro
byPT Pintcast - Physical Therapy
0 ratings
0% found this document useful
Why point-of-care ultrasound should be the standard for lung assessment: Innovations in technology have enabled the creation of point-of-care ultrasound, which allows clinicians to gather images at the first patient assessment, improving care quality by reducing wait times for diagnosis. Despite the benefits of...
Podcast episode
Why point-of-care ultrasound should be the standard for lung assessment: Innovations in technology have enabled the creation of point-of-care ultrasound, which allows clinicians to gather images at the first patient assessment, improving care quality by reducing wait times for diagnosis. Despite the benefits of...
byModern Healthcare’s Healthcare Insider Podcast
0 ratings
0% found this document useful
Podcast Rewind: It's Personalized – The Future of Data-Driven Health: Note: This episode originally aired in January 2023. Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and ...
Podcast episode
Podcast Rewind: It's Personalized – The Future of Data-Driven Health: Note: This episode originally aired in January 2023. Tune in this week as special guest, Dr. Leroy Hood, an accomplished scientist best known for his integral work on the Human Genome Project, discusses data-driven analysis of chronic diseases and ...
byThe Thorne Podcast
0 ratings
0% found this document useful
Tissue is Never the Issue
Podcast episode
Tissue is Never the Issue
byOncology Knowledge into Practice Podcast
0 ratings
0% found this document useful
Journal Review in Surgical Education: The OR Black Box
Podcast episode
Journal Review in Surgical Education: The OR Black Box
byBehind The Knife: The Surgery Podcast
0 ratings
0% found this document useful
Research Review of LIPPSMAck-POP from PEDro: We talked to Ianthe Boden who is a Physiotherapist, PhD student, actively researching and enacting therapies to improve outcomes for patients in ICU and after major surgery. Check out the trial here: https://archivesphysiotherapy.biomedcentral.
Podcast episode
Research Review of LIPPSMAck-POP from PEDro: We talked to Ianthe Boden who is a Physiotherapist, PhD student, actively researching and enacting therapies to improve outcomes for patients in ICU and after major surgery. Check out the trial here: https://archivesphysiotherapy.biomedcentral.
byPT Pintcast - Physical Therapy
0 ratings
0% found this document useful
New Phoenix Pediatric Sepsis Criteria by L. Schlapbach et al | OPENPediatrics: In this World Shared Practice Forum Podcast, auth…
Podcast episode
New Phoenix Pediatric Sepsis Criteria by L. Schlapbach et al | OPENPediatrics: In this World Shared Practice Forum Podcast, auth…
byOPENPediatrics
0 ratings
0% found this document useful
428: Technology and Informatics in Physiotherapy Education: On this episode of the Healthy Wealthy and Smart Podcast, I welcome Dr. Mark Merolli, Ann Green and Professor Catherine Dean. In this episode we discuss our upcoming focused symposium at the World Confederation for Physical Therapy Congress in Geneva...
Podcast episode
428: Technology and Informatics in Physiotherapy Education: On this episode of the Healthy Wealthy and Smart Podcast, I welcome Dr. Mark Merolli, Ann Green and Professor Catherine Dean. In this episode we discuss our upcoming focused symposium at the World Confederation for Physical Therapy Congress in Geneva...
byHealthy Wealthy & Smart
0 ratings
0% found this document useful
Episode 53: Brian Caulfield on wearable technologies and the potential of electrical muscle stimulation: Brian Caulfield, University College Dublin, wearables, mobile sensing technology, electrical muscle stimulation, physiotherapy, PowerDot, rugby, soccer, fast type fibers,blood flow restriction,ankle sprains,concussions, Ken Ford,Dawn Kernagis,IHMC
Podcast episode
Episode 53: Brian Caulfield on wearable technologies and the potential of electrical muscle stimulation: Brian Caulfield, University College Dublin, wearables, mobile sensing technology, electrical muscle stimulation, physiotherapy, PowerDot, rugby, soccer, fast type fibers,blood flow restriction,ankle sprains,concussions, Ken Ford,Dawn Kernagis,IHMC
bySTEM-Talk
0 ratings
0% found this document useful
The Knowledge Sessions: Implementation Science - what can veterinary medicine learn from human medicine?: How can we express evidence in a way that means people will engage with it and understand it? How do team dynamics and individual context influence whether evidence is used appropriately and fully? What can we learn from the experience in human medic...
Podcast episode
The Knowledge Sessions: Implementation Science - what can veterinary medicine learn from human medicine?: How can we express evidence in a way that means people will engage with it and understand it? How do team dynamics and individual context influence whether evidence is used appropriately and fully? What can we learn from the experience in human medic...
byRCVS Knowledge - Evidence-based Veterinary Medicine
0 ratings
0% found this document useful
The Knowledge Sessions: Research in Practice - A beginner's guide: Doing research as a veterinary practitioner is easier and more rewarding than you think. But what kind of research can you do? What are the benefits of doing research in practice, and what skills and resources do you need? Where do you get started, a...
Podcast episode
The Knowledge Sessions: Research in Practice - A beginner's guide: Doing research as a veterinary practitioner is easier and more rewarding than you think. But what kind of research can you do? What are the benefits of doing research in practice, and what skills and resources do you need? Where do you get started, a...
byRCVS Knowledge - Evidence-based Veterinary Medicine
0 ratings
0% found this document useful
Metaverse Medicine
Podcast episode
Metaverse Medicine
byOIS Podcast | Ophthalmology's leading Podcast
0 ratings
0% found this document useful
Reflections from the British Geriatrics Society Spring conference: Pete, Vicky and Nick are live at the British Geriatrics Society Spring Conference and are joined by ACCS acute medicine trainee Lily Shevlin. They discuss what they have learned from the conference, important topics such as DNACPR forms and treatment esca
Podcast episode
Reflections from the British Geriatrics Society Spring conference: Pete, Vicky and Nick are live at the British Geriatrics Society Spring Conference and are joined by ACCS acute medicine trainee Lily Shevlin. They discuss what they have learned from the conference, important topics such as DNACPR forms and treatment esca
byCotEcast
0 ratings
0% found this document useful
Ep. 156 Percutaneous Lung Biopsies Part I: The Basics and Tips/Tricks with Dr. Fred Lee: We start off Part 1 of a 2 part series with Dr. Fred Lee discussing Percutaneous Lung Biopsy Technique, with tips and tricks to help your daily practice.
Podcast episode
Ep. 156 Percutaneous Lung Biopsies Part I: The Basics and Tips/Tricks with Dr. Fred Lee: We start off Part 1 of a 2 part series with Dr. Fred Lee discussing Percutaneous Lung Biopsy Technique, with tips and tricks to help your daily practice.
byBackTable Vascular & Interventional
0 ratings
0% found this document useful
Sumanta Kumar Pal, MD, FASCO - Selecting and Sequencing Targeted and Immunotherapy Regimens for RCC: How Will the Latest Evidence Impact Treatment Decisions for My Patients?: Go online to PeerView.com/XZJ860 to view the activity, download slides and practice aids, and complete the post-test to earn credit.
Podcast episode
Sumanta Kumar Pal, MD, FASCO - Selecting and Sequencing Targeted and Immunotherapy Regimens for RCC: How Will the Latest Evidence Impact Treatment Decisions for My Patients?: Go online to PeerView.com/XZJ860 to view the activity, download slides and practice aids, and complete the post-test to earn credit.
byPeerView Internal Medicine CME/CNE/CPE Audio Podcast
0 ratings
0% found this document useful
Ep. 171 The Making of a “Good” IR with Dr. Lola Oladini: Dr. Eric Keller talks with Dr. Lola Oladini from Stanford Medicine Department of Radiology about what makes optimal training for Interventional Radiologists, including discussion on the variety of strengths of programs across the country, balancing diagnostics with procedural training, and what it means in being a "clinical IR".
Podcast episode
Ep. 171 The Making of a “Good” IR with Dr. Lola Oladini: Dr. Eric Keller talks with Dr. Lola Oladini from Stanford Medicine Department of Radiology about what makes optimal training for Interventional Radiologists, including discussion on the variety of strengths of programs across the country, balancing diagnostics with procedural training, and what it means in being a "clinical IR".
byBackTable Vascular & Interventional
0 ratings
0% found this document useful
Ep. 298 New Innovations in the Treatment of PE: The Flow Medical Story with Founders Dr. Osman Ahmed and Dr. Jonathan Paul
Podcast episode
Ep. 298 New Innovations in the Treatment of PE: The Flow Medical Story with Founders Dr. Osman Ahmed and Dr. Jonathan Paul
byBackTable Vascular & Interventional
0 ratings
0% found this document useful
How the Polyvagal Theory Can Help You and Your Patients: Dr. Don discusses the numerous training events and methods he has been employing in the last few months in today’s episode. We also hear a clip from a recent Zoom call for doctors in London preparing for an upcoming seminar there. Brandi joins in to ...
Podcast episode
How the Polyvagal Theory Can Help You and Your Patients: Dr. Don discusses the numerous training events and methods he has been employing in the last few months in today’s episode. We also hear a clip from a recent Zoom call for doctors in London preparing for an upcoming seminar there. Brandi joins in to ...
byThe Informed Chiropractor Podcast
0 ratings
0% found this document useful
#51 – Kevin Esvelt and Jonas Sandbrink on Risks from Biological Research
Podcast episode
#51 – Kevin Esvelt and Jonas Sandbrink on Risks from Biological Research
byHear This Idea
0 ratings
0% found this document useful
"Video Laryngoscopy to Improve Intubation Success in Pediatrics" by Dr. Akira Nishisaki: In this World Shared Practice Forum podcast, Dr. …
Podcast episode
"Video Laryngoscopy to Improve Intubation Success in Pediatrics" by Dr. Akira Nishisaki: In this World Shared Practice Forum podcast, Dr. …
byOPENPediatrics
0 ratings
0% found this document useful
Saved Time Saves Lives: The Impact of Rapid Turnaround Time for Cancer Patients: In this Science with a Twist episode, our host Kathy Davy is joined by Dr. Jeffrey Scott, the chief medical officer and president of population health solutions at Integra Connect. Dr. Scott's team is on a mission to help specialty care organizations – from providers and health systems to payers and life sciences companies become clinically and financially successful with value-based precision medicine. As part of this mission, Integra Connect and Thermo Fisher recently collaborated on research presented at the American Society of Clinical Oncology (ASCO) annual meeting, the world's leading oncology conference. This episode is dedicated to sharing the details of this research and the impact new approaches to genomic testing can have on cancer patient outcomes.
Podcast episode
Saved Time Saves Lives: The Impact of Rapid Turnaround Time for Cancer Patients: In this Science with a Twist episode, our host Kathy Davy is joined by Dr. Jeffrey Scott, the chief medical officer and president of population health solutions at Integra Connect. Dr. Scott's team is on a mission to help specialty care organizations – from providers and health systems to payers and life sciences companies become clinically and financially successful with value-based precision medicine. As part of this mission, Integra Connect and Thermo Fisher recently collaborated on research presented at the American Society of Clinical Oncology (ASCO) annual meeting, the world's leading oncology conference. This episode is dedicated to sharing the details of this research and the impact new approaches to genomic testing can have on cancer patient outcomes.
byScience with a Twist
0 ratings
0% found this document useful
The Age of Scientific Wellness: The Future of Medicine Is Personalized, Predictive, Data-Rich and in Your Hands: Taking us to the cutting edge of the new frontier of medicine, a visionary biotechnologist and a pathbreaking researcher show how we can optimize our health in ways that were previously unimaginable.
Podcast episode
The Age of Scientific Wellness: The Future of Medicine Is Personalized, Predictive, Data-Rich and in Your Hands: Taking us to the cutting edge of the new frontier of medicine, a visionary biotechnologist and a pathbreaking researcher show how we can optimize our health in ways that were previously unimaginable.
byCommonwealth Club of California Podcast
0 ratings
0% found this document useful
Episode 65: Research – trainees insight/experience
Podcast episode
Episode 65: Research – trainees insight/experience
byRCP Medicine Podcast
0 ratings
0% found this document useful
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
Podcast episode
Machine Learning and Artificial Intelligence in the Clinical Microbiology Laboratory (JCM ed.): The idea of applying machine learning and digital pathology platforms to everyday workflows in the clinical microbiology laboratory has become increasing intriguing and appealing, especially as labs continue to optimize efficiency in the midst of...
byEditors in Conversation
0 ratings
0% found this document useful
Bioscience Breakthroughs – David Freedman, Co-founder and Founding CEO of NanoView Biosciences: David Freedman, a co-founder and the founding CEO of NanoView Biosciences, discusses noninvasive diagnostics and extracellular vesicles. Freedman was essential to NanoView’s entry into the expanding extracellular vesicle market with the development...
Podcast episode
Bioscience Breakthroughs – David Freedman, Co-founder and Founding CEO of NanoView Biosciences: David Freedman, a co-founder and the founding CEO of NanoView Biosciences, discusses noninvasive diagnostics and extracellular vesicles. Freedman was essential to NanoView’s entry into the expanding extracellular vesicle market with the development...
byFinding Genius Podcast
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 2 #10
byThe Anthrozoology Podcast
0 ratings
0% found this document useful
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
Podcast episode
The Anthrozoology Podcast - Problematizing the Ethics Process: An Anthrozoological Perspective Part 1 #9
byThe Anthrozoology Podcast
0 ratings
0% found this document useful

Skip carousel

Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
STAT
Article
Opinion: Sharing Clinical Trial Data: Lessons From The YODA Project
Nov 18, 2019
The culture of clinical research is changing, and there are now expectations that researchers will share data — even when it isn't required.
5 min read
THE WORLD’S BEST Smart Hospitals 2023
Newsweek
Article
THE WORLD’S BEST Smart Hospitals 2023
Sep 16, 2022
7 min read
Opening The ‘Black Box,’ Google DeepMind AI System Diagnoses Eye Diseases And Shows Its Work
STAT
Article
Opening The ‘Black Box,’ Google DeepMind AI System Diagnoses Eye Diseases And Shows Its Work
Aug 13, 2018
Experts said the level of accuracy is impressive, but the bigger breakthrough is the DeepMind system’s solution to the so-called “black box” problem of artificial intelligence.
5 min read
AI Teaches Brain Tumor Surgery Better Than Human Experts
Futurity
Article
AI Teaches Brain Tumor Surgery Better Than Human Experts
Feb 24, 2022
2 min read
Opinion: China Isn’t Yet Ready To Conduct Clinical Trials For The Pharma Industry
STAT
Article
Opinion: China Isn’t Yet Ready To Conduct Clinical Trials For The Pharma Industry
Aug 3, 2018
3 min read
Wheeler’s Breathtaking Ignorance of Science, in One Comment
Union of Concerned Scientists
Article
Wheeler’s Breathtaking Ignorance of Science, in One Comment
Jun 6, 2019
4 min read
Opinion: Teach Medical Students To Use Diagnostic Tools With Their Patients
STAT
Article
Opinion: Teach Medical Students To Use Diagnostic Tools With Their Patients
Sep 19, 2018
Medical students deserve to get consistent training in skills that define high quality, evidence-based, modern medical practice.
3 min read
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
STAT
Article
Opinion: All Study Participants Have A Right To Know Their Own Results. My Lab Has Been Doing That For Years
Sep 5, 2018
Giving study participants their individual results can drive greater public participation in research, increased support for science, and better health.
6 min read
The National Academies Illustrates the More Nuanced Value of Transparency in Science
Union of Concerned Scientists
Article
The National Academies Illustrates the More Nuanced Value of Transparency in Science
May 13, 2019
4 min read
An AI Startup, Led By 23andMe Veteran Andy Page, Tries To Take Better Pictures Of The Heart
STAT
Article
An AI Startup, Led By 23andMe Veteran Andy Page, Tries To Take Better Pictures Of The Heart
Oct 1, 2019
Let’s assume you are not an expert highly trained in medical imaging. And let’s assume you were invited one day to try out a new technology for heart ultrasounds — diagnostic tools that are notoriously difficult to use because of the chest wall and b
4 min read
Signs Of Concussion Turn Up In Spit
Futurity
Article
Signs Of Concussion Turn Up In Spit
Nov 10, 2020
3 min read
Wireless Neuro-Stimulator To Revolutionise Patient Care
The Art of Healing
Article
Wireless Neuro-Stimulator To Revolutionise Patient Care
Aug 25, 2022
1 min read
Opinion: Synthetic Control Arms Can Save Time And Money In Clinical Trials
STAT
Article
Opinion: Synthetic Control Arms Can Save Time And Money In Clinical Trials
Feb 5, 2019
Synthetic control arms aren't the solution to all of the challenges facing randomized trials, but they represent a great way for drug development companies to start using real-world evidence.
4 min read
Distracted Doctoring
New Philosopher
Article
Distracted Doctoring
Mar 1, 2023
6 min read
Devices Ease Limitations For Doctors With Disabilities
Futurity
Article
Devices Ease Limitations For Doctors With Disabilities
Apr 19, 2018
3 min read
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Union of Concerned Scientists
Article
Updated Restricted Science Rule Spells Reanalysis Paralysis for the EPA
Nov 12, 2019
7 min read
Opinion: Electronic Health Records Are Still Waiting To Be Transformed
STAT
Article
Opinion: Electronic Health Records Are Still Waiting To Be Transformed
Apr 11, 2019
Electronic health records aren't yet a transformative tool to support clinical decision-making. Many physicians feel they have traded physical filing cabinets for digital ones.
4 min read
‘Lung On A Chip’ Mimics Deadly Disease
Futurity
Article
‘Lung On A Chip’ Mimics Deadly Disease
May 29, 2018
New biotechnology could speed up the development of medicines to treat pulmonary fibrosis, one of the most common and serious forms of lung disease, research shows. Developing new medicines isn’t easy, researchers say, because it’s difficult to simul
2 min read
DeepMind AI Predicts Acute Loss Of Kidney Function Two Days In Advance, Study Shows
STAT
Article
DeepMind AI Predicts Acute Loss Of Kidney Function Two Days In Advance, Study Shows
Jul 31, 2019
DeepMind's AI was able to predict 90% of acute kidney injury episodes that required dialysis, with a lead time of 48 hours.
2 min read
Team Aims To Make Activity Tracker Data More Consistent
Futurity
Article
Team Aims To Make Activity Tracker Data More Consistent
Feb 17, 2022
2 min read
With Diving Gear And Plumbing Supplies, California Labs Fashion Covid-19 Masks And Ventilators
STAT
Article
With Diving Gear And Plumbing Supplies, California Labs Fashion Covid-19 Masks And Ventilators
Apr 9, 2020
Engineers are firing up 3D printers to make masks and using prowess honed by designing spacecraft to repurpose plumbing supplies into cheap, effective ventilators for #coronavirus patients.
7 min read
Opinion: Who Missed The Chance To Stop The CRISPR Babies Scientist? Look In The Mirror
STAT
Article
Opinion: Who Missed The Chance To Stop The CRISPR Babies Scientist? Look In The Mirror
Nov 30, 2018
3 min read
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
STAT
Article
Opinion: Real-world Evidence Is Changing The Way We Study Drug Safety And Effectiveness
Jan 29, 2019
Instead of waiting for fleshed-out protocols for using real-world evidence, pharmaceutical companies should dive in now using a commonsense approach that relies on rigor and reason.
4 min read
How Do Drugs Get Into The Blood?
Futurity
Article
How Do Drugs Get Into The Blood?
Apr 25, 2023
3 min read
Formula For Excellence
India Today
Article
Formula For Excellence
May 24, 2018
The All India Institute of Medical Sciences, at its core, is a culture. Sixty years ago, the founding fathers of the institute developed a culture of excellence, in education, research, academics and patient care, which ultimately depended on its fac
4 min read
Medical Students Are Skipping Class In Droves — And Making Lectures Increasingly Obsolete
STAT
Article
Medical Students Are Skipping Class In Droves — And Making Lectures Increasingly Obsolete
Aug 14, 2018
“There were times that I didn’t go to a single class, and then I’d get to the actual exam and it would be my first time seeing the professor,” said…
7 min read
Why People Shy Away From Using Public Defibrillators
Futurity
Article
Why People Shy Away From Using Public Defibrillators
Sep 25, 2017
People are reluctant to use public access defibrillators to treat cardiac arrests, research suggests. Published in the European Heart Journal, the new analysis of existing international studies indicates that a number of factors prevent members of th
2 min read
The Florey Medal ~ A Legacy of Life
AQ: Australian Quarterly
Article
The Florey Medal ~ A Legacy of Life
Sep 30, 2016
5 min read
EPA’s Chance to Get Science Advice Right
Union of Concerned Scientists
Article
EPA’s Chance to Get Science Advice Right
Jul 21, 2020
4 min read
Healing HANDS
Practical Horseman
Article
Healing HANDS
Mar 12, 2019
8 min read

Related categories

Skip carousel

Reviews for Deep Learning for Medical Image Analysis

Rating: 4.333333333333333 out of 5 stars

4.5/5

3 ratings0 reviews

Book preview

Deep Learning for Medical Image Analysis - S. Kevin Zhou

2016

Part I

Introduction

Outline

Chapter 1. An Introduction to Neural Networks and Deep Learning

Chapter 2. An Introduction to Deep Convolutional Neural Nets for Computer Vision

Chapter 1

An Introduction to Neural Networks and Deep Learning

Heung-Il Suk Korea University, Seoul, Republic of Korea

Abstract

Artificial neural networks, conceptually and structurally inspired by neural systems, are of great interest along with deep learning, thanks to their great successes in various fields including medical imaging analysis. In this chapter, we describe the fundamental concepts and ideas of (deep) neural networks and explain algorithmic advances to learn network parameters efficiently by avoiding overfitting. Specifically, this chapter focuses on introducing (i) feed-forward neural networks, (ii) gradient descent-based parameter optimization algorithms, (iii) different types of deep models, (iv) technical tricks for fast and robust training of deep models, and (v) open source deep learning frameworks for quick practice.

Keywords

Neural networks; Convolutional neural network; Deep learning; Deep belief network; Deep Boltzmann machine

Chapter Outline

1.1 Introduction

1.2 Feed-Forward Neural Networks

1.2.1 Perceptron

1.2.2 Multi-Layer Neural Network

1.2.3 Learning in Feed-Forward Neural Networks

1.3 Convolutional Neural Networks

1.3.1 Convolution and Pooling Layer

1.3.2 Computing Gradients

1.4 Deep Models

1.4.1 Vanishing Gradient Problem

1.4.2 Deep Neural Networks

1.4.2.1 Auto-Encoder

1.4.2.2 Stacked Auto-Encoder

1.4.3 Deep Generative Models

1.4.3.1 Restricted Boltzmann Machine

1.4.3.2 Deep Belief Network

1.4.3.3 Deep Boltzmann Machine

1.5 Tricks for Better Learning

1.5.1 Rectified Linear Unit (ReLU)

1.5.2 Dropout

1.5.3 Batch Normalization

1.6 Open-Source Tools for Deep Learning

References

1.1 Introduction

A brain or biological neural network is considered as the most well-organized system that processes information from different senses such as sight, hearing, touch, taste, and smell in an efficient and intelligent manner. One of the key mechanisms for information processing in a human brain is that the complicated high-level information is processed by means of the collaboration, i.e., connections (called synapses), of a large number of the structurally simple elements (called neurons). In machine learning, artificial neural networks are a family of models that mimic the structural elegance of the neural system and learn patterns inherent in observations.

1.2 Feed-Forward Neural Networks

This section introduces neural networks that process information in a feed-forward manner. Throughout the chapter, matrices and vectors are denoted as boldface uppercase letters and boldface lowercase letters, respectively, and scalars are denoted as normal italic letters. For a transpose operator, a superscript ⊤ is used.

1.2.1 Perceptron

The simplest learnable artificial neural model, known as perceptron , and an output unit y , the value of the output unit y by taking the weighted sum of the inputs as follows:

(1.1)

is a bias. Let us introduce a pre-activation variable z , a ‘logistic sigmoid, is commonly used for a binary classification task.

Figure 1.1 An architecture of a single-layer neural network.

(as follows:

(1.2)

. As for the activation function, it is common to use a ‘softmaxfor multi-class classification, where the output values can be interpreted as probability.

1.2.2 Multi-Layer Neural Network

One of the main limitations of the single-layer neural network comes from its linear separation for a classification task, despite the use of nonlinear activation function. This limitation can be circumvented by introducing a so-called ‘hidden’ layer between the input layer and the output layer as shown in Fig. 1.2, where the double circles denote a vectorization of the units in the respective layers for simplification. Note that in Fig. 1.2, there can exist multiple units in each layer and units of the neighboring layers are fully connected to each other, but no connections in the same layer. For a two-layer neural network, which is also known as multi-layer perceptron, we can write its composition function as follows:

(1.3)

where the superscript denotes a layer index, M ) and the corresponding estimation function is defined as

(1.4)

Figure 1.2 An architecture of a two-layer neural network in simplified representation.

is commonly defined with a sigmoidal function such as a ‘logistic sigmoid, or a ‘hyperbolic tangentfor hyperbolic tangent function.

1.2.3 Learning in Feed-Forward Neural Networks

In terms of network learning, there are two fundamental problems, namely, network architecture learning and network parameters learning. While the network architecture learning still remains an open question,² there exists an efficient algorithm for network parameters learning as circumstantiated below.

denotes a class indicator vector with one-of-K encoding, i.e., for a class k, only the kis 1 and all the other elements are 0. For K-class classification, it is common to use a cross-entropy cost function defined as follows:

(1.5)

denotes the kis the kfrom an L-layer neural network can be obtained by Eq. (1.4).

The error function in Eq. evaluated at the parameter set Θ.

For a feed-forward neural network, the gradient can be efficiently evaluated by means of error backpropagation [2]. The key idea of backpropagation algorithm is to propagate errors from the output layer back to the input layer by a chain rule. Specifically, in an L-layer neural network, the derivative of an error function E with respect to the parameters for l, can be estimated as follows:

(1.6)

respectively denote the pre-activation vector and the activation vector of the layer l , corresponds to the error computed at the output layer. For the estimation of the gradient of an error function E can be also computed in a similar way as

(1.7)

(1.8)

Fig. 1.3 illustrates a graphical representation of estimating the gradients of an error function with respect to parameters in an L, through arrows from tail (source) to head (destination), starting from the rightmost double circle node. When a double circle node of the layer l as

(1.9)

(1.10)

where ⊙ denotes an element-wise multiplication. After updating the error message, the double circle node of the layer l . It should be noted that the double circle nodes can send their message to the neighboring nodes, once they receive an error message from its source node.

Figure 1.3 Graphical representation of a backpropagation algorithm in an L -layer network.

Once we obtain the gradient vector of all the layers, i.e., the shaded nodes at the bottom of the graph in is updated as follows:

(1.11)

is obtained via backpropagation, η is a learning rate, and τ denotes an iteration index. The update process repeats until convergence or the predefined number of iterations is reached.

As for the parameter update in Eq. (1.11), there are two different approaches depending on the timing of parameter update, namely, batch gradient descent and stochastic gradient descent. The batch gradient descent updates the parameters based on the gradients ∇E evaluated over the whole training samples. Meanwhile, the stochastic gradient descent sequentially updates weight parameters by computing gradient on the basis of one sample at a time. When it comes to large scale learning such as deep learning, it is advocated to apply stochastic gradient descent [3]. As a trade-off between batch gradient and stochastic gradient, a mini-batch gradient descent method, which computes and updates the parameters on the basis of a small set of samples, is commonly used in the literature [4].

1.3 Convolutional Neural Networks

In conventional multi-layer neural networks, the inputs are always in vector form. However, for (medical) images, the structural or configural information among neighboring pixels or voxels is another source of information. Hence, vectorization inevitably destroys such structural and configural information in images. A Convolutional Neural Network (CNN) that typically has convolutional layers interspersed with pooling (or sub-sampling) layers and then followed by fully connected layers as in a standard multi-layer neural network (Fig. 1.4) is designed to better utilize such spatial and configuration information by taking 2D or 3D images as input. Unlike the conventional multi-layer neural networks, a CNN exploits extensive weight-sharing to reduce the degrees of freedom of models. A pooling layer helps reduce computation time and gradually build up spatial and configural invariance.

Figure 1.4 An architecture of a convolutional neural network.

1.3.1 Convolution and Pooling Layer

, i.e., connection weights between the feature map i and the feature map j at the layer l. Specifically, the units of the convolution layer l as follows:

(1.12)

is a nonlinear activation function. Due to the local connectivity and weight sharing, we can greatly reduce the number of parameters compared to a fully connected neural network, and thus it is possible to avoid overfitting. Further, when the input image is shifted, the activation of the units in the feature maps are also shifted by the same amount, which allows a CNN to be equivariant to small shifts, as illustrated in Fig. 1.5. In the figure, when the pixel values in the input image are shifted by one-pixel right and one-pixel down, the outputs after convolution are also shifted by one-pixel right and one-pixel down.

Figure 1.5 Illustration of translation invariance in convolution neural network. The bottom leftmost input is a translated version of the upper leftmost input image by one-pixel right and one-pixel down.

A pooling layer follows a convolution layer to downsample the feature maps of the preceding convolution layer. Specifically, each feature map in a pooling layer is linked with a feature map in the convolution layer, and each unit in a feature map of the pooling layer is computed based on a subset of units in its receptive field. Similar to the convolution layer, the receptive field that finds a maximal value among the units in its receptive field is convolved with the convolution map but with a stride of the size of the receptive field so that the contiguous receptive fields are not overlapped commonly. The role of the pooling layer is to progressively reduce the spatial size of the feature maps, and thus reduce the number of parameters and computation involved in the network. Another important function of the pooling layer is for translation invariance over small spatial shifts in the input. In Fig. 1.5, while the bottom leftmost image is a translated version of the top leftmost image by one-pixel right and one-pixel down, their outputs after convolution and pooling operations are the same, especially for the units in green.

1.3.2 Computing Gradients

Assume that a convolution layer is followed by a pooling layer. In such a case, units in a feature map of a convolution layer l as follows:

(1.13)

denotes an up-sampling operation.

For the case when a current layer, whether it is a pooling layer or a convolution layer, is followed by a convolution layer, we must figure out which patch in the current layer's feature map corresponds to a unit in the next layer's feature map. The kernel weights multiplying the connections between the input patch and the output unit are exactly the weights of the convolutional kernel. The gradients for the kernel weights are computed by the chain rule similar to backpropagation. However, since the same weights are now shared across many connections, we need to sum the gradients for a given weight over all the connections using the kernel weights as follows:

(1.14)

denotes the patch in the i.

1.4 Deep Models

Under a mild assumption on the activation function, a two-layer neural network with a finite number of hidden units can approximate any continuous function [5]. Thus, it is regarded as a universal approximator. However, it is also possible to approximate complex functions to the same accuracy using a ‘deep’ architecture (i.e., multiple layers) with much fewer number of neurons in total. Hence, we can reduce the number of weight parameters, thus allowing us to train with a relatively small-sized dataset [6]. More importantly, compared to shallow architecture that needs a ‘good’ feature extractor, mostly designed by engineering skills or domain expert knowledge, deep models are good at discovering good features automatically in a hierarchical manner, i.e., fine-to-abstract [7].

1.4.1 Vanishing Gradient Problem

While there have been earlier attempts to deploy deep neural networks, the lack of training samples and the limited computational power prohibited them from successful use for applications. Recently, thanks to the huge amount of data available online and the advances in hardware such as multi-core Central Processing Units (CPUs) and Graphics Processing Units (GPUs), those difficulties have been resolved.

However, so-called ‘vanishing gradientin Fig. 1.3 become ineffective due to vanishing gradients after repeated multiplications. In this regard, the gradients tend to get smaller and smaller as they propagate backward through the hidden layers and units in the lower layers (close to the input layer) learn much more slowly than units in the higher layers (close to the output layer).

Regarding the vanishing gradient, Glorot and Bengio [8] pointed out that the use of logistic sigmoid activation function caused the derivatives of units with respect to connection weight parameters to saturate near 0 early in training, and hence substantially slowed down learning speed. As a potential remedy to this problem, they suggested alternative activation functions such as a hyperbolic tangent function and a soft sign function with ways to initialize weights [8]. In the meantime, Hinton and Salakhutdinov devised a greedy layer-wise pretraining technique [9] that made a breakthrough of sorts to the vanishing gradient problem. From a learning perspective, in conventional network training, we randomly initialize the connection weights and then learn the whole weight parameters jointly by means of gradient-descent methods along with backpropagation algorithm for gradients computation in a supervised manner. The idea of pretraining is that instead of directly adjusting the whole weight parameters from random initial values, it is beneficial to train the connection weights of the hidden layers in an unsupervised and layer-wise fashion first, and then fine-tune the connection weights jointly. This pretraining technique will be described with further details below in a stacked auto-encoder.

1.4.2 Deep Neural Networks

Here, we introduce a deep neural network that constructs a deep architecture by taking auto-encoders as building blocks for a hierarchical feature representation.

1.4.2.1 Auto-Encoder

An auto-encoder, also called as auto-associator, is a special type of a two-layer neural network, composed of an input layer, a hidden layer, and an output layer. The input layer is fully connected to the hidden layer, which is further fully connected to the output layer as illustrated in Fig. 1.6A. The aim of an auto-encoder is to learn a latent or compressed representation of an input, by minimizing the reconstruction error between the input and the reconstructed values from the learned representation.

Figure 1.6 A graphical illustration of an auto-encoder and a stacked auto-encoder.

through a linear mapping and then a nonlinear transformation with a nonlinear activation function f as follows:

(1.15)

is a bias vector. The representation a , which approximately reconstructs the input vector x by another mapping as follows:

(1.16)

are a decoding weight matrix and a bias vector, respectively. Structurally, the number of input units and the number of output units are determined by the dimension of an input vector. Meanwhile, the number of hidden units can be determined based on the nature of the data. If the number of hidden units is less than the dimension of the input data, then the auto-encoder can be used for dimensionality reduction. However, it is worth noting that to obtain complicated nonlinear relations among input features, it is possible to allow the number of hidden units to be even larger than the input dimension, from which we can still find an interesting structure by imposing a sparsity constraint [10,11].

From a learning perspective, the goal of an auto-encoder is to minimize the reconstruction error between the input x and the output y of the jth hidden unit over the training samples and the target average activation ρ defined as [12]:

(1.17)

Then our objective function can be written as:

(1.18)

where γ denotes a sparsity control parameter. With the introduction of the KL divergence, the error function is penalized by a large average activation of a hidden unit over the training samples by setting ρ to be small. This penalization drives the activation of many hidden units to be equal or close to zero by making sparse connections between layers.

1.4.2.2 Stacked Auto-Encoder

Note that the outputs of units in the hidden layer become the latent representation of the input vector. However, due to its simple shallow structural characteristic, the representational power of a single-layer auto-encoder is known to be very limited. But when stacking with multiple auto-encoders by taking the activation values of hidden units of an auto-encoder as the input to the following upper auto-encoder, and thus building an SAE, it is possible to greatly improve the representational power [13]. Thanks to the hierarchical structure, one of the most important characteristics of the SAE is to learn or discover highly nonlinear and complicated patterns such as the relations among input features. When an input vector is presented to an SAE, the different layers of the network represent different levels of information. That is, the lower the layer in the network, the simpler the patterns that are learned; the higher the layer, the more complicated or abstract patterns inherent in the input feature vector.

With regard to training parameters of the weight matrices and the biases in an SAE, a straightforward way is to apply backpropagation with the gradient-based optimization technique starting from random initialization by regarding the SAE as a conventional multi-layer neural network. Unfortunately, it is generally known that deep networks trained in that manner perform worse than networks with a shallow architecture, suffering from falling into a poor local optimum [11]. To circumvent this problem, it is good to consider a greedy layer-wise learning [9]. The key idea in a greedy layer-wise learning is to train one layer at a time by maximizing the variational lower bound. That is, we first train the first hidden layer with the training data as input, and then train the second hidden layer with the outputs from the first hidden layer as input, and so on. That is, the representation of the lth hidden layer. This greedy layer-wise learning is performed as ‘pretraining’ (Figs. 1.7A–1.7C). The important feature of pretraining is that it is conducted in an unsupervised manner with a standard backpropagation algorithm [14].

Figure 1.7 Greedy layer-wise pretraining and fine-tuning of the whole network. ( H i denotes the i th hidden layer in the network.)

When it comes to a classification problem, we stack another output layer on top of the SAE (Fig. 1.7D) with an appropriate activation function. This top output layer is used to represent the class-label of an input sample. Then by taking the pretrained connection weights as the initial parameters for the hidden units and randomly initializing the connection weights between the top hidden layer and the output layer, it is possible to train the whole parameters jointly in a supervised manner by gradient descent with a backpropagation algorithm. Note that the initialization of the parameters via pretraining helps the supervised optimization, called ‘fine-tuning’, reduce the risk of falling into local poor optima [9,11].

1.4.3 Deep Generative Models

Unlike the deep neural networks, deep generative models draw samples from the trained model, which thus allows checking qualitatively what has been learned by these models, for instance, by visualizing images that the model has learned. Here, we introduce two deep generative models, namely, Deep Belief Network (DBN) and Deep Boltzmann Machine (DBM), which use restricted Boltzmann machines (RBMs) as basic building blocks for their construction.

1.4.3.1 Restricted Boltzmann Machine

An RBM is a two-layer undirected graphical model with visible and hidden units in each layer (Fig. 1.8). It assumes a symmetric connectivity W between the visible layer and the hidden layer, but no connections within the layers, and each layer has a bias term, a and bmodels the structures or dependencies over visible units, where D and F denote the numbers of visible and hidden units, respectively. Note that because of the symmetry of the weight matrix W, it is possible to reconstruct the input observations from the hidden representations. Therefore, an RBM is naturally regarded as an auto-encoder [9] and this favorable characteristic is used in RBM parameters learning [9].

Figure 1.8 An architecture of a restricted Boltzmann machine (A) and its simplified illustration (B).

is given by

(1.19)

is a partition function that can be obtained by summing over all possible pairs of v and his defined as

(1.20)

The conditional distribution of the hidden units given the visible units and also the conditional distribution of the visible units given the hidden units are respectively computed as

(1.21)

(1.22)

is a logistic sigmoid function. Due to the unobservable hidden units, the objective function is defined as the marginal distribution of the visible units as

(1.23)

. For this case, it is common to use a Gaussian RBM [9], in which the energy function is given by

(1.24)

denotes a standard deviation of the i. This variation leads to the following conditional distribution of visible units given the binary hidden units

(1.25)

The RBM parameters are usually trained using a contrastive divergence algorithm [15] that maximizes the log-likelihood of observations. When taking the derivative of the log-likelihood with respect to parameters Θ, we obtain

(1.26)

are approximated by means of a Gibbs sampling technique [16] and their difference provides the direction of updating parameters in (stochastic) gradient descent as follows:

(1.27)

(1.28)

(1.29)

where α denotes the learning rate.

1.4.3.2 Deep Belief Network

Since an RBM is a kind of an auto-encoder, it is straightforward to stack multiple RBMs for deep architecture construction, similar to SAE, which results in a single probabilistic model called a Deep Belief Network (DBN). That is, a DBN has one visible layer v as shown in denote the corresponding RBM parameters. Note that while the top two layers still form an undirected generative model, i.e., RBM, the lower layers form directed generative models. Hence, the joint distribution of the observed units v and the L ) in DBN is given as

(1.30)

corresponds to a conditional distribution for the units of the layer l and L.

Figure 1.9 Graphical illustrations of two different deep generative models with L hidden layers.

As for the parameter learning, the pretraining scheme described in Section 1.4.2.2 can also be applied as follows:

3. Train the second layer as an RBM, taking the transformed data (samples or mean activations) as training examples (for the visible layer of the RBM).

4. Iterate 2 and 3 for the desired number of layers, each time propagating upward either samples or mean activations.

This greedy layer-wise training of the DBN can be justified as increasing a variational lower bound on the log-likelihood of the data [9]. After the greedy layer-wise procedure is completed, it is possible to perform generative fine-tuning using the wake-sleep algorithm [17]. But in practice, no further procedure is made to train the whole DBN jointly. In order to use a DBN in classification, a trained DBN can also be directly used to initialize a deep neural network with the trained weights and biases. Then the deep neural network can be fine-tuned by means of backpropagation and (stochastic) gradient descent.

1.4.3.3 Deep Boltzmann Machine

A DBM is also structured by stacking multiple RBMs in a hierarchical manner. However, unlike DBN, all the layers in DBM still form an undirected generative model after stacking RBMs as illustrated in Fig. 1.9B. Thus, for the hidden layer l.

in in the DBM is given by

(1.31)

. Given the values of the units in the neighboring layer(s), the probability of the binary visible or binary hidden units being set to 1 is computed as

(1.32)

(1.33)

(1.34)

, we incorporate both the lower visible layer v , and this makes DBM differentiated from DBN and also more robust to noisy observations [18,19].

For a classification task, it is possible to use DBM by replacing an RBM at the top hidden layer with a discriminative RBM [20], which can also be applied for DBN. That is, the top hidden layer is now connected to both the lower hidden layer and an additional label layer, which indicates the label of the input vin the discriminative DBM is given by

(1.35)

where U denote a connectivity between the top hidden layer and the label layer and a class-label indicator vector, respectively, C is computed

Enjoying the preview?

Page 1 of 1

Deep Learning for Medical Image Analysis

About this ebook

Read more from S. Kevin Zhou

Related to Deep Learning for Medical Image Analysis

Related ebooks

Intelligence (AI) & Semantics For You

Related podcast episodes

Related articles

Related categories

Reviews for Deep Learning for Medical Image Analysis

What did you think?

Book preview

Deep Learning for Medical Image Analysis - S. Kevin Zhou

Part I

Introduction

Abstract

Keywords

Chapter Outline

1.1 Introduction

1.2 Feed-Forward Neural Networks

1.2.1 Perceptron

(1.1)

(1.2)

1.2.2 Multi-Layer Neural Network

(1.3)

(1.4)

1.2.3 Learning in Feed-Forward Neural Networks

(1.6)

(1.9)

(1.10)

1.3 Convolutional Neural Networks

1.3.1 Convolution and Pooling Layer

(1.12)

1.3.2 Computing Gradients

1.4 Deep Models

1.4.1 Vanishing Gradient Problem

1.4.2 Deep Neural Networks

1.4.2.1 Auto-Encoder

(1.17)

1.4.2.2 Stacked Auto-Encoder

1.4.3 Deep Generative Models

1.4.3.1 Restricted Boltzmann Machine

(1.19)

(1.20)

(1.21)

(1.22)

(1.23)

(1.24)

(1.25)

(1.26)

(1.27)

(1.28)

(1.29)

1.4.3.2 Deep Belief Network

(1.30)

1.4.3.3 Deep Boltzmann Machine

(1.31)

(1.32)

(1.33)

(1.34)

(1.35)