• About
  • Privacy Policy
  • Disclaimer
  • Contact
Soft Bliss Academy
No Result
View All Result
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups
Soft Bliss Academy
No Result
View All Result
Home Machine Learning

Improve Vision Language Model Chain-of-thought Reasoning

softbliss by softbliss
June 9, 2025
in Machine Learning
0
0
SHARES
0
VIEWS
Share on FacebookShare on Twitter


Chain-of-thought (CoT) reasoning in vision language
models (VLMs) is crucial for improving
interpretability and trustworthiness. However,
current training recipes often relying on
datasets dominated by short annotations with
minimal rationales. In this work, we show that
training VLM on short answers leads to poor
generalization on reasoning tasks that require
more detailed explanations. To address this limitation,
we propose a two-stage post-training
strategy that extends the usage of short answer
data for enhanced CoT reasoning. First, we
augment short answers with CoT reasoning
generated by GPT-4o, enhancing the VLM’s
CoT capabilities through fine-tuning. Second,
we leverage short answers as outcome rewards
for reinforcement learning. Specifically, short
answers are used as correctness indicators to
construct positive (correct) and negative (incorrect)
pairs from model-generated reasoning
chains. These pairs are then used to calibrate
the model’s reasoning via Direct Preference Optimization.
Our experiments show significant
improvements in CoT reasoning on benchmark
datasets, along with enhanced generalization to
direct answer prediction. This work provides
a critical data resource for VLM CoT training
and demonstrates the effectiveness of outcome
rewards for multimodal models post-training.

  • † Work done while at Apple
  • ‡ Carnegie Mellon University
Tags: ChainofThoughtImproveLanguagemodelReasoningVision
Previous Post

Adding support for Google Pay within Android WebView

Next Post

Google DeepMind and Isomorphic Labs introduce AlphaFold 3 AI model

softbliss

softbliss

Related Posts

Structured-Then-Unstructured Pruning for Scalable MoE Pruning [Paper Reflection]
Machine Learning

Structured-Then-Unstructured Pruning for Scalable MoE Pruning [Paper Reflection]

by softbliss
June 9, 2025
AI model deciphers the code in proteins that tells them where to go | MIT News
Machine Learning

AI model deciphers the code in proteins that tells them where to go | MIT News

by softbliss
June 9, 2025
Google Search AI Mode now offers data visualization and charts
Machine Learning

Google Search AI Mode now offers data visualization and charts

by softbliss
June 8, 2025
Top 7 AWS Services for Machine Learning
Machine Learning

Top 7 AWS Services for Machine Learning

by softbliss
June 8, 2025
🚀 5 Powerful Open Source Projects Backed by Big Tech Companies — and Changing the World of Development | by TechTales | Jun, 2025
Machine Learning

🚀 5 Powerful Open Source Projects Backed by Big Tech Companies — and Changing the World of Development | by TechTales | Jun, 2025

by softbliss
June 8, 2025
Next Post
Google DeepMind and Isomorphic Labs introduce AlphaFold 3 AI model

Google DeepMind and Isomorphic Labs introduce AlphaFold 3 AI model

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Premium Content

Oscars Embrace A.I. with Important Restrictions

Oscars Embrace A.I. with Important Restrictions

May 1, 2025
Embracing AI as a Creative Collaborator

Embracing AI as a Creative Collaborator

May 27, 2025
UP Catalyst raises €18 million to advance the EU’s critical raw material production

UP Catalyst raises €18 million to advance the EU’s critical raw material production

April 8, 2025

Browse by Category

  • Artificial Intelligence
  • Machine Learning
  • Research & Academia
  • Software Development
  • Startups

Browse by Tags

Amazon App Apps Artificial Blog Build Building Business Coding Data Development Digital Framework Future Gemini Generative Google Guide Impact Innovation Intelligence Key Language Large Learning LLM LLMs Machine Microsoft MIT model Models News NVIDIA opinion OReilly Research Series Software Startup Startups students Tech Tools Video

Soft Bliss Academy

Welcome to SoftBliss Academy, your go-to source for the latest news, insights, and resources on Artificial Intelligence (AI), Software Development, Machine Learning, Startups, and Research & Academia. We are passionate about exploring the ever-evolving world of technology and providing valuable content for developers, AI enthusiasts, entrepreneurs, and anyone interested in the future of innovation.

Categories

  • Artificial Intelligence
  • Machine Learning
  • Research & Academia
  • Software Development
  • Startups

Recent Posts

  • 20+ DIY Fidget Toys That Are Easy and Inexpensive to Make
  • What is Intellectual Property (IP) and Why Is It Important?
  • Google DeepMind and Isomorphic Labs introduce AlphaFold 3 AI model

© 2025 https://softblissacademy.online/- All Rights Reserved

No Result
View All Result
  • Home
  • Artificial Intelligence
  • Software Development
  • Machine Learning
  • Research & Academia
  • Startups

© 2025 https://softblissacademy.online/- All Rights Reserved

Are you sure want to unlock this post?
Unlock left : 0
Are you sure want to cancel subscription?