• Latest
  • Trending
  • All
  • Market Updates
  • Cryptocurrency
  • Blockchain
  • Investing
  • Commodities
  • Personal Finance
  • Technology
  • Business
  • Real Estate
  • Finance
Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

April 11, 2025
Stocks making the biggest moves after hours: FIVE, MDB, VRNT

Stocks making the biggest moves after hours: FIVE, MDB, VRNT

June 5, 2025
A Major Warning Signal for Investors

A Major Warning Signal for Investors

June 5, 2025
Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong

Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong

June 5, 2025
Special limited offer for 2 indicators / SULEIMAN LEVELS v 7.7 – Analytics & Forecasts – 4 June 2025

Special limited offer for 2 indicators / SULEIMAN LEVELS v 7.7 – Analytics & Forecasts – 4 June 2025

June 5, 2025
It’s always steel — tariffs provide Trump with a familiar trade weapon

It’s always steel — tariffs provide Trump with a familiar trade weapon

June 4, 2025
Elon Musk sells Twitter to xAI

Musk’s opposition is ‘one disagreement’ in an otherwise harmonious relationship

June 4, 2025
Chime’s IPO may struggle to strike a chord with investors

Chime’s IPO may struggle to strike a chord with investors

June 4, 2025
RBC says the USD ‘remains extremely overvalued’, ‘much more weakness still lies ahead’

RBC says the USD ‘remains extremely overvalued’, ‘much more weakness still lies ahead’

June 4, 2025
Bitcoin Price Crash Below $100,000 Still Possible: Analysts Issue Downtrend Warnings

Bitcoin Price Crash Below $100,000 Still Possible: Analysts Issue Downtrend Warnings

June 4, 2025
How To Get Your Life Together: 10 Step Checklist

How To Get Your Life Together: 10 Step Checklist

June 4, 2025
How Hard It Is to Make Trade Deals

How Hard It Is to Make Trade Deals

June 4, 2025
Trump’s commerce secretary hints at Bitcoin-only strategic reserve

Semler Scientific Buys $20M More Bitcoin

June 4, 2025
Thursday, June 5, 2025
No Result
View All Result
InvestorNewsToday.com
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech
InvestorNewsToday.com
No Result
View All Result
Home Technology

Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!

by Investor News Today
April 11, 2025
in Technology
0
Now it’s TikTok parent ByteDance’s turn for a reasoning AI: enter Seed-Thinking-v1.5!
491
SHARES
1.4k
VIEWS
Share on FacebookShare on Twitter

Be a part of our day by day and weekly newsletters for the newest updates and unique content material on industry-leading AI protection. Be taught Extra


It began with the announcement of OpenAI’s o1 mannequin in September 2024, however actually took off with DeepSeek R1 launched in January 2025.

Now, plainly most main AI mannequin suppliers and trainers are in a brand new race to ship higher, quicker, cheaper, extra reasonably priced or extra highly effective and performant “reasoning” AI language fashions — that’s, ones that possibly take slightly longer to answer a human person, however ideally achieve this with higher, extra complete, extra nicely “reasoned” solutions, which these class of fashions get by performing “chain-of-thought,” reflecting on their very own conclusions and interrogating them for veracity earlier than responding.

ByteDance, the Chinese language net media big dad or mum of TikTok, is the newest to hitch the social gathering with announcement and publication of the technical paper behind Seed-Considering-v1.5, an upcoming giant language mannequin (LLM) designed to advance reasoning efficiency throughout each science, tech, math, and engineering (STEM) fields and general-purpose domains.

The mannequin shouldn’t be but out there for obtain or use, and it’s unclear what the licensing phrases shall be — whether or not it is going to be proprietary/closed supply or open supply/free for all to make use of and modify at will, or someplace in between. However the technical paper gives some noteworthy particulars which are price going over now prematurely of each time it’s made out there.

Constructed atop the more and more in style Combination-of-Specialists (MoE) structure

Like Meta’s new Llama 4 and Mistral’s Mixtral earlier than it, Seed-Considering-v1.5 is constructed utilizing a Combination-of-Specialists (MoE) structure.

This structure is designed to make fashions extra environment friendly, primarily combining the capabilities of a number of fashions into one, every mannequin specializing in a special area.

On this case, the MoE structure signifies that Seed-Considering-v1.5 makes use of solely 20 billion parameters at a time from a complete of 200 billion.

ByteDance says in its technical paper printed to GitHub that Seed-Considering-v1.5 prioritizes structured reasoning and considerate response era.

The outcomes almost converse for themselves, with Seed-Considering-v1.5 outperforming DeepSeek R1 and approaching Google’s newly launched Gemini 2.5 Professional and OpenAI’s o3-mini-high reasoner on many third-party benchmark evaluations, even exceeding these two within the case of the ARC-AGI benchmark, which measures progress in direction of synthetic normal intelligence, seen because the aim or “Holy Grail” of AI — a mannequin that outperforms people on most economically worthwhile duties, in accordance with OpenAI’s definition.

Positioned as a compact but succesful different to bigger state-of-the-art fashions, Seed-Considering-v1.5 achieves aggressive benchmark outcomes and introduces improvements in reinforcement studying (RL), coaching knowledge curation, and AI infrastructure.

Efficiency benchmarks and mannequin focus

Seed-Considering-v1.5 reveals sturdy efficiency on a collection of difficult duties, scoring 86.7% on AIME 2024, 55.0% move@8 on Codeforces, and 77.3% on the GPQA science benchmark. These outcomes place it near or matching fashions like OpenAI’s o3-mini-high and Google’s Gemini 2.5 Professional on particular reasoning metrics.

On non-reasoning duties, the mannequin was evaluated by means of human desire comparisons and achieved an 8.0% larger win charge over DeepSeek R1, suggesting that its strengths generalize past simply logic or math-heavy challenges.

To handle saturation in frequent benchmarks like AIME, ByteDance launched BeyondAIME, a brand new, tougher math benchmark with curated issues designed to withstand memorization and higher discriminate mannequin efficiency. This and the Codeforces analysis set are anticipated to be publicly launched to help future analysis.

Information technique

Coaching knowledge performed a central position within the mannequin’s improvement. For supervised fine-tuning (SFT), the group curated 400,000 samples, together with 300,000 verifiable (STEM, logic, and coding duties) and 100,000 non-verifiable issues like artistic writing and role-playing.

For RL coaching, knowledge was segmented into:

  • Verifiable issues: 100,000 rigorously filtered STEM questions and logic puzzles with identified solutions, sourced from elite competitions and knowledgeable evaluate.
  • Non-verifiable duties: Human-preference datasets targeted on open-ended prompts, evaluated utilizing pairwise reward fashions.

The STEM knowledge leaned closely on superior arithmetic, accounting for over 80% of the issue set. Extra logic knowledge included duties like Sudoku and 24-point puzzles, with adjustable issue to match mannequin progress.

Reinforcement studying strategy

Reinforcement studying in Seed-Considering-v1.5 is powered by customized actor-critic (VAPO) and policy-gradient (DAPO) frameworks, developed to handle identified instabilities in RL coaching. These strategies concentrate on decreasing reward sign sparsity and enhancing coaching stability, particularly in lengthy chain-of-thought (CoT) settings.

Reward fashions play a vital position in supervising RL outputs. ByteDance launched two key instruments:

  • Seed-Verifier: A rule-based LLM that checks if generated and reference solutions are mathematically equal.
  • Seed-Considering-Verifier: A step-by-step reasoning-based choose that improves judgment consistency and resists reward hacking.

This two-tiered reward system allows nuanced analysis for each simple and complicated duties.

Infrastructure and scaling

To help environment friendly large-scale coaching, ByteDance constructed a system atop its HybridFlow framework, with execution dealt with by Ray clusters and co-located coaching and inference processes to scale back GPU idle time.

A notable innovation is the Streaming Rollout System (SRS), which separates mannequin evolution from runtime execution. It accelerates iteration velocity by asynchronously managing partially accomplished generations throughout mannequin variations. This structure reportedly delivers as much as 3× quicker RL cycles.

Extra infrastructure strategies embrace:

  • Combined precision (FP8) for reminiscence financial savings
  • Professional parallelism and kernel auto-tuning for MoE effectivity
  • ByteCheckpoint for resilient and versatile checkpointing
  • AutoTuner for optimizing parallelism and reminiscence configurations

Human analysis and real-world impression

To guage alignment with human-centric preferences, ByteDance performed human testing throughout a spread of domains together with artistic writing, humanities information, and normal dialog.

Seed-Considering-v1.5 persistently outperformed DeepSeek R1 throughout periods, reinforcing its applicability to real-world person wants.

The event group notes that reasoning fashions skilled totally on verifiable duties demonstrated sturdy generalization to artistic domains—an end result attributed to the construction and rigor embedded in mathematical coaching workflows.

What it means for technical leaders, knowledge engineers and enterprise decision-makers

For technical leads managing the lifecycle of huge language fashions—from knowledge curation to deployment—Seed-Considering-v1.5 presents a chance to rethink how reasoning capabilities are built-in into enterprise AI stacks.

Its modular coaching course of, which incorporates verifiable reasoning datasets and multi-phase reinforcement studying, is especially interesting to groups seeking to scale LLM improvement whereas retaining fine-grained management.

ByteDance’s strikes to introduce Seed-Verifier and Seed-Considering-Verifier provide mechanisms for extra reliable reward modeling, which might be vital when deploying fashions into customer-facing or regulated environments.

For groups that usually function below tight deadlines and restricted bandwidth, the mannequin’s stability below reinforcement studying—enabled by improvements like VAPO and dynamic sampling—may cut back iteration cycles and streamline fine-tuning for particular duties.

From an orchestration and deployment perspective, the mannequin’s hybrid infrastructure strategy—together with the Streaming Rollout System (SRS) and help for FP8 optimization—suggests important beneficial properties in coaching throughput and {hardware} utilization.

These options could be worthwhile for engineers accountable for scaling LLM operations throughout cloud and on-prem techniques. The truth that Seed-Considering-v1.5 was skilled with mechanisms to adapt reward suggestions primarily based on runtime dynamics speaks on to the challenges of managing heterogeneous knowledge pipelines and sustaining consistency throughout domains.

For groups tasked with making certain reliability, reproducibility, and steady integration of latest instruments, Seed-Considering-v1.5’s system-level design may function a blueprint for constructing sturdy, multi-modal orchestration techniques.

For knowledge engineering professionals, the structured strategy to coaching knowledge—together with rigorous filtering, augmentation, and knowledgeable verification—reinforces the significance of information high quality as a multiplier of mannequin efficiency. This might encourage extra deliberate approaches to dataset improvement and validation pipelines.

Future outlook

Seed-Considering-v1.5 is the results of collaboration inside ByteDance’s Seed LLM Programs group, led by Yonghui Wu and with public illustration by Haibin Lin, a long-time AI contributor.

The venture additionally attracts on earlier efforts like Doubao 1.5 Professional and incorporates shared strategies in RLHF and knowledge curation.

Wanting forward, the group plans to proceed refining reinforcement studying strategies, with a concentrate on coaching effectivity and reward modeling for non-verifiable duties. The general public launch of inside benchmarks corresponding to BeyondAIME is meant to foster broader development in reasoning-focused AI analysis.

Every day insights on enterprise use instances with VB Every day

If you wish to impress your boss, VB Every day has you lined. We provide the inside scoop on what firms are doing with generative AI, from regulatory shifts to sensible deployments, so you may share insights for max ROI.

Learn our Privateness Coverage

Thanks for subscribing. Take a look at extra VB newsletters right here.

An error occured.



Source link
Tags: ByteDancesenterparentreasoningSeedThinkingv1.5TikTokTurn
Share196Tweet123
Previous Post

Indicator VeMAs XAUUSD this week. – My Trading – 11 April 2025

Next Post

Scotland’s Lomond School accepts Bitcoin for tuition payments, a first in the UK

Investor News Today

Investor News Today

Next Post
Scotland’s Lomond School accepts Bitcoin for tuition payments, a first in the UK

Scotland's Lomond School accepts Bitcoin for tuition payments, a first in the UK

  • Trending
  • Comments
  • Latest
Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

Equinor scales back renewables push 7 years after ditching ‘oil’ from its name

February 5, 2025
Best High-Yield Savings Accounts & Rates for January 2025

Best High-Yield Savings Accounts & Rates for January 2025

January 3, 2025
Suleiman Levels limited V 3.00 Update and Offer – Analytics & Forecasts – 5 January 2025

Suleiman Levels limited V 3.00 Update and Offer – Analytics & Forecasts – 5 January 2025

January 5, 2025
10 Best Ways To Get Free $10 in PayPal Money Instantly

10 Best Ways To Get Free $10 in PayPal Money Instantly

December 8, 2024
Why America’s economy is soaring ahead of its rivals

Why America’s economy is soaring ahead of its rivals

0
Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

Dollar climbs after Donald Trump’s Brics tariff threat and French political woes

0
Nato chief Mark Rutte’s warning to Trump

Nato chief Mark Rutte’s warning to Trump

0
Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

Top Federal Reserve official warns progress on taming US inflation ‘may be stalling’

0
Stocks making the biggest moves after hours: FIVE, MDB, VRNT

Stocks making the biggest moves after hours: FIVE, MDB, VRNT

June 5, 2025
A Major Warning Signal for Investors

A Major Warning Signal for Investors

June 5, 2025
Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong

Stop guessing why your LLMs break: Anthropic’s new tool shows you exactly what goes wrong

June 5, 2025
Special limited offer for 2 indicators / SULEIMAN LEVELS v 7.7 – Analytics & Forecasts – 4 June 2025

Special limited offer for 2 indicators / SULEIMAN LEVELS v 7.7 – Analytics & Forecasts – 4 June 2025

June 5, 2025

Live Prices

© 2024 Investor News Today

No Result
View All Result
  • Home
  • Market
  • Business
  • Finance
  • Investing
  • Real Estate
  • Commodities
  • Crypto
  • Blockchain
  • Personal Finance
  • Tech

© 2024 Investor News Today