Yahoo Malaysia Web Search

Search results

  1. 1 Jul 2024 · Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time. Sanjoy Chowdhury, Sayan Nag, Subhrajyoti Dasgupta, Jun Chen, Mohamed Elhoseiny, Ruohan Gao, Dinesh Manocha. Leveraging Large Language Models' remarkable proficiency in text-based tasks, recent works on Multi-modal LLMs (MLLMs) extend them to other modalities like vision ...

  2. 7 Jul 2024 · Check out Sanjoy Chowdhury's movies list, family details, net worth, age, height, filmography, biography, upcoming movies, photos, awards, songs, videos and Latest News only on Filmibeat.

  3. 1 Jul 2024 · Meerkat: Audio-Visual Large Language Model for Grounding in Space and Time. Sanjoy Chowdhury, Sayan Nag, +4 authors. Dinesh Manocha. Published 1 July 2024. Computer Science. TLDR. This work presents Meerkat, an audio-visual LLM equipped with a fine-grained understanding of image and audio both spatially and temporally, and introduces ...

  4. 17 Jun 2024 · MeLFusion is a text-to-music diffusion model with a novel "visual synapse", which effectively infuses the semantics from the visual modality into the generated music. To facilitate research in this area, we introduce a new dataset MeLBench, and propose a new evaluation metric IMSM.

  5. 19 Jun 2024 · Vishnu Sashank Dorbala, Sanjoy Chowdhury, Dinesh Manocha Abstract We present a novel approach to automatically synthesize “wayfinding instructions” for an embodied robot agent.

  6. dblp.org › db › confdblp: ICCP 2023

    4 Jul 2024 · Jiaye Wu, Sanjoy Chowdhury, Hariharmano Shanmugaraja, David Jacobs, Soumyadip Sengupta: Measured Albedo in the Wild: Filling the Gap in Intrinsics Evaluation.

  7. 5 Jul 2024 · Sanjoy Paul is an associate professor at the University of Technology Sydney's Business School and Priyabrata Chowdhury is a senior lecturer at RMIT University's College of Business and Law.