From Perception to Action: Multimodal Embodied AI Seminar

Seminar: Prof. Winston Hsu (National Taiwan University)
Lunedì 16 Giugno – ore 11:30 – Aula 120 Centro
Didattico Morgagni, V.le Morgagni 40-44 Firenze

“From Perception to Action: Multimodal Embodied AI” Seminar: Prof. Winston Hsu (National Taiwan University)
Lunedì 16 Giugno – ore 11:30 – Aula 120 Centro Didattico Morgagni, V.le Morgagni 40-44

Abstract: In this talk, we will showcase a series of research works bridging “perception” and “action,” demonstrating advanced scenarios supporting robotics and automotive applications while incorporating human-in-the-loop methodologies. We will envision future possibilities for multimodal embodied AI and emphasize the importance of efficiency and sustainability in cloud- edge-hybrid inference systems.

Bio: Dr. Winston Hsu is a Professor at National Taiwan University. Previously (2021-2024), he served as the CTO and Vice President at MobileDrive, a joint venture between Foxconn and Stellantis focused on advancing digital cockpit and ADAS in HPC-based solutions. He earned his Ph.D. from Columbia University, New York, in 2007. Dr. Hsu is an active researcher in computer vision and machine intelligence, providing AI strategy consulting to several leading Taiwan-based ICT companies. He is the Founding Director of the NVIDIA AI Lab at NTU, the first NVIDIA AI Lab in Asia. Dr. Hsu was a visiting scientist at Microsoft Research Redmond (2014) and the IBM T.J. Watson Research Center (2016). Additionally, he co-founded thingnario, an AI/IoT startup specializing in renewable energy solutions. Earlier in his career, he was a founding engineer and R&D manager at CyberLink Corp.

My Kieu, Andrew D. Bagdanov, Marco Bertini, Alberto del Bimbo, Task-conditioned Domain Adaptation for Pedestrian Detection in Thermal Imagery

ECCV 2020 Paper

Domain Adaptation for Pedestrian Detection

Small Pixels at PNICube 2019

Finalist - PNICube Competition

ACM Multimedia 2022 Awards

Best Paper & Demo Honorable Mention

Stan Sclaroff

Lecture by Stan Sclaroff

Attention, Capabilities of Humans, Algorithms in Vision