• LOGIN
    Login with username and password
Repository logo

BORIS Portal

Bern Open Repository and Information System

  • Publications
  • Theses
  • Research Data
  • Projects
  • Organizations
  • Researchers
  • More
  • Collections
  • Statistics
  • LOGIN
    Login with username and password
Repository logo
Unibern.ch
  1. Home
  2. Publications
  3. A reinforcement learning approach for VQA validation: An application to diabetic macular edema grading.
 

A reinforcement learning approach for VQA validation: An application to diabetic macular edema grading.

Options
  • Details
  • Files
BORIS DOI
10.48350/182556
Publisher DOI
10.1016/j.media.2023.102822
PubMed ID
37182321
Description
Recent advances in machine learning models have greatly increased the performance of automated methods in medical image analysis. However, the internal functioning of such models is largely hidden, which hinders their integration in clinical practice. Explainability and trust are viewed as important aspects of modern methods, for the latter's widespread use in clinical communities. As such, validation of machine learning models represents an important aspect and yet, most methods are only validated in a limited way. In this work, we focus on providing a richer and more appropriate validation approach for highly powerful Visual Question Answering (VQA) algorithms. To better understand the performance of these methods, which answer arbitrary questions related to images, this work focuses on an automatic visual Turing test (VTT). That is, we propose an automatic adaptive questioning method, that aims to expose the reasoning behavior of a VQA algorithm. Specifically, we introduce a reinforcement learning (RL) agent that observes the history of previously asked questions, and uses it to select the next question to pose. We demonstrate our approach in the context of evaluating algorithms that automatically answer questions related to diabetic macular edema (DME) grading. The experiments show that such an agent has similar behavior to a clinician, whereby asking questions that are relevant to key clinical concepts.
Date of Publication
2023-07
Publication Type
Article
Subject(s)
600 Technology > 610 Medicine & health
Keyword(s)
Interpretability Reinforcement learning Retinal image analysis VQA Visual Turing test Visual question answering validation
Language(s)
en
Contributor(s)
Fountoukidou, Tatianaorcid-logo
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Sznitman, Raphaelorcid-logo
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Additional Credits
ARTORG Center for Biomedical Engineering Research - AI in Medical Imaging Laboratory
Series
Medical image analysis
Publisher
Elsevier
ISSN
1361-8415
Access(Rights)
restricted
Show full item
BORIS Portal
Bern Open Repository and Information System
Build: dd892c [ 9.04. 8:30]
Explore
  • Projects
  • Funding
  • Publications
  • Research Data
  • Organizations
  • Researchers
  • Audiovisual Material
  • Software & other digital items
  • Events
More
  • About BORIS Portal
  • Send Feedback
  • Cookie settings
  • Service Policy
Follow us on
  • Mastodon
  • YouTube
  • LinkedIn
UniBe logo