Technical University of Munich

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (bibtex)

by U Sahin, H Li, Q Khan, D Cremers and T Volker

Reference:

Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining (U Sahin, H Li, Q Khan, D Cremers and T Volker), In IEEE Winter Conference on Applications of Computer Vision (WACV, 2024. ([arXiv][project page][code])

Bibtex Entry:

@inproceedings{compreason2024,
 title = {Enhancing Multimodal Compositional Reasoning of Visual Language Models with Generative Negative Mining},
 booktitle = {IEEE Winter Conference on Applications of Computer Vision (WACV},
 author = {U Sahin and H Li and Q Khan and D Cremers and T Volker},
 year = {2024},
 keywords = {neural networks, deep learning, Large Language Models},
}

Powered by bibtexbrowser

Go Back

Computer Vision Group
TUM School of Computation, Information and Technology
Technical University of Munich

Technical University of Munich

Links

Informatik IX
Computer Vision Group

News

GCPR / VMV 2024

Navigation

Rechte Seite

Informatik IX
Computer Vision Group

News

GCPR / VMV 2024

Computer Vision GroupTUM School of Computation, Information and TechnologyTechnical University of Munich

Technical University of Munich

Links

Informatik IX Computer Vision Group

News

GCPR / VMV 2024

Navigation

Rechte Seite

Informatik IX Computer Vision Group

News

GCPR / VMV 2024

Computer Vision Group
TUM School of Computation, Information and Technology
Technical University of Munich

Informatik IX
Computer Vision Group

Informatik IX
Computer Vision Group