Yahoo Malaysia Web Search

Search results

  1. Jul 1, 2024 · Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time. Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha. Leveraging Large Language Models' remarkable proficiency in text-based tasks, recent works on Multi-modal LLMs (MLLMs) extend them to other modalities like vision ...

  2. 2 days ago · Check out Sanjoy Chowdhury's movies list, family details, net worth, age, height, filmography, biography, upcoming movies, photos, awards, songs, videos and Latest News only on Filmibeat.

  3. Jul 1, 2024 · Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time. Sanjoy Chowdhury, Sayan Nag, +4 authors. Dinesh Manocha. Published 1 July 2024. Computer Science. TLDR. This work presents Meerkat, an audio-visual LLM equipped with a fine-grained understanding of image and audio both spatially and temporally, and introduces ...

  4. Jun 17, 2024 · MeLFusion is a text-to-music diffusion model with a novel "visual synapse", which effectively infuses the semantics from the visual modality into the generated music. To facilitate research in this area, we introduce a new dataset MeLBench, and propose a new evaluation metric IMSM.

  5. Jun 19, 2024 · Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha Abstract We present a novel approach to automatically synthesize “wayfinding instructions” for an embodied robot agent.

  6. dblp.org › db › confdblp: ICCP 2023

    5 days ago · Jiaye Wu, Sanjoy Chowdhury, Hariharmano Shanmugaraja, David Jacobs, Soumyadip Sengupta: Measured Albedo in the Wild: Filling the Gap in Intrinsics Evaluation.

  7. 4 days ago · Sanjoy Paul is an associate professor at the University of Technology Sydney's Business School and Priyabrata Chowdhury is a senior lecturer at RMIT University's College of Business and Law.