Seed-Thinking-v1.5: A New Leap in Reasoning AI with Reinforcement Learning
- Apal Tech Editorial
- Apr 17
- 1 min read
ByteDance Seed has officially introduced Seed-Thinking-v1.5 – a groundbreaking reasoning AI model powered by large-scale Reinforcement Learning (RL). This marks a significant milestone in advancing AI reasoning closer to human-like thinking.

🌟 Key Highlights:
🤖 Impressive Reasoning Capabilities:Seed-Thinking-v1.5 excels not only in logic-based problems but also performs effectively in real-world scenarios—an area where many previous models have struggled.
📊 Outstanding Benchmark Performance:
86.7% on AIME 2024 – matching OpenAI’s o3-mini-high
Outperforms competitors like o1 and DeepSeek R1
77.3% on GPQA for scientific questions
Demonstrates strong coding abilities on Codeforces
+8% positive user feedback in non-reasoning tasks (compared to DeepSeek R1)
🔍 Three Drivers Behind the Breakthrough:
Rich and carefully curated training data
Two new RL algorithms – VAPO and DAPO→ Enhance training stability
Modern training infrastructure→ Up to 3x faster than previous RL-based models
📌 Looking Ahead:While challenges like the BeyondAIME test remain, the research team at
ByteDance Seed is committed to ongoing improvements, pushing the boundaries of reasoning AI even further.
This isn’t just a technical advancement—it’s an open invitation for the community to explore the boundless potential of AI, from academia to real-world applications.
👉 Stay tuned to see where the Seed-Thinking journey will take us next!
Comments