Computer Vision and Multimodal Computing Reading Group

We read and discuss papers in the areas of (1) diffusion models (2) digatal avatars (3) multimodal perception+robotics (4) computer vision for social interaction (5) implicit neural representations (6) other areas related to our research. If you would like to join this group, please send email to: yapeng dot tian at utdallas dot edu.

Regular Meeting Time & Place

Scheduled Meetings




Shijian Deng: PaLM-E: An embodied multimodal language model


Yulang Wu: What are Diffusion Models?


Siva Sai Nagender Vasireddy: DiffusionDet: Diffusion Model for Object Detection


Harsh Singh: Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation


Shijian Deng: High-resolution image reconstruction with latent diffusion models from human brain activity

Current Participants

Prof. Yapeng Tian
Siva Sai Nagender Vasireddy, PhD Student
Shijian Deng, PhD Student
Harsh Singh, PhD Student
Yulang Wu, Graduate student
Prathyushaa Vajravelu Karthikeyan, Graduate student
Sasha Kaplan, Undergraduate student
Zeke Barnett, K12

This website was inspired by Topology Data Analysis Reading Group.