Paper 14085-26

Stress-testing medical vision models with images edited by controllable inversion algorithms

15 April 2026 • 13:50 - 14:10 CEST | Luxembourg/Salon 2 (Niveau/Level 0)

Abstract

Biomedical visual models aim to facilitate the diagnosis, detection, or segmentation of X-ray images. Recently, deep learning has contributed to this task. However, these models are trained on a collection of images that carry biases due to the contexts in which they are usually gathered. During the stress-test process, it is possible to quantitatively measure the performance of these models on instances outside the distribution they were trained on. To this end, it is proposed to edit biomedical images, particularly chest X-rays. Synthetic editing saves the effort of manually collecting medical images or requesting databases that may raise ethical concerns about their origin. The editing is carried out using latent probabilistic diffusion models. The algorithm for inverting diffused images during inference is modified to provide information from the image to be edited and text as condition. In order to avoid entire edition by covering only desired areas, we proposed to contrast conditional and unconditional diffusion outputs as well as iteratively minimizing absolute distance between predicted target and denoised image input. Results show modifications are meet, by providing only extra text specification with no need for masked areas. Finally we measure limitations over ResNet biomedical classification model, allowing to quantify performance with synthetic easy-to-obtain data.

Presenter

Michel Pezzat

Center of Advanced Computation Studies (Mexico), Laboratory of Advance Image Processing (Mexico), National Autonomous University of Mexico (Mexico)

Graduated from the National Polithecnic Institute in 2013, with Electronic and Communication Degree. By 2017, I started my academic career in Signal Processing during a posgrade to obtain Master in Science Degree in Microelectronics. I started research in the machine learning by developing audio classification and synthesis models. I obtained my Doctorade Degree by 2024, publishing paper in the generative models a year before. This same generative models I started to extrapolate them in the biomedical image processing fields. Which allowed me to start a posdoctorade next year. Such program is still on-going.

Application tracks: AI/ML

Presenter/Author

Michel Pezzat

Center of Advanced Computation Studies (Mexico), Laboratory of Advance Image Processing (Mexico), National Autonomous University of Mexico (Mexico)

Author

Jimena Olveres Montiel

Univ. Nacional Autónoma de México (Mexico)

Author

Boris Escalante-Ramírez

Univ. Nacional Autónoma de México (Mexico)

Author

Rodrigo Ramoz Dias

Univ. Nacional Autónoma de México (Mexico)

Author

Carlos Minutti

Ctr. de Investigación e Innovación en Tecnologías de la Información y Comunicación (Mexico), Univ. Nacional Autónoma de México (Mexico)