• About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
Tuesday, June 16, 2026
mGrowTech
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions
No Result
View All Result
mGrowTech
No Result
View All Result
Home Al, Analytics and Automation

Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides

Josh by Josh
June 16, 2026
in Al, Analytics and Automation
0
Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides


Tokyo-based Sakana AI shipped its first commercial product ‘Sakana Marlin’ this week. Sakana team positions it as a Virtual CSO (Chief Strategy Officer). It is a B2B autonomous research agent built for enterprises.

Marlin does not answer in seconds like a chatbot. You give it one research topic. It then runs autonomously for up to about eight hours. Each run returns a long report plus a presentation slide deck. Sakana says a single session issues hundreds to thousands of LLM queries.

What is Sakana Marlin

Marlin is an enterprise research agent, not a chat assistant. You give it one topic or question. It then plans hypotheses, browses sources, and verifies findings on its own. It compresses weeks of strategy work into hours.

The deliverable is structured for decision-makers. The Japanese announcement describes reports of dozens of pages. The English announcement cites reports of up to roughly 100 pages. At a press hands-on, reports ran 60–100 pages and cited 60–80 sources. Each report includes a main body, references, and appendices. Presentation slides are generated using image-generation AI.

Sakana team refined Marlin through a closed beta in April 2026. Around 300 professionals tested it on real tasks during that beta. Those tasks spanned strategy formulation, market research, risk analysis, and competitive analysis. Sakana has also partnered with MUFG and taken strategic investment from Citigroup.

Inside AB-MCTS: Wider or Deeper

The backbone of Marlin is AB-MCTS, or Adaptive Branching Monte Carlo Tree Search. It comes from the Sakana’s past research “Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search.”

AB-MCTS treats reasoning as a tree-search problem. At each step the algorithm makes one decision. It can go wider by generating a new candidate answer. Or it can go deeper by refining a promising existing answer. Standard repeated sampling only goes wider in parallel, then hopes one answer is right.

A multi-LLM variant adds a second choice. It can route a step to a different model entirely. In Sakana’s reported ARC-AGI-2 experiments, this collaboration helped. Combining o4-mini, Gemini 2.5 Pro, and DeepSeek-R1 solved about 27.5% of tasks. The o4-mini model alone solved about 23%. Marlin applies the same adaptive search to long-horizon research.

The second key component for Marlin is workflow automation from Sakana’s AI Scientist project. That project demonstrated autonomous scientific discovery and was published in Nature.

Interactive demo: The embeddable widget (marlin-abmcts-demo.html) shows the “wider or deeper” decision live. Press Run and watch the tree grow. Greener nodes carry higher scores, and the best path is highlighted. Toggle “Multi-LLM” to see steps routed across different models.


AB-MCTS: “Wider or Deeper?” — interactive search

A simplified visual of Sakana AI’s Adaptive Branching Monte Carlo Tree Search. Each step the policy chooses to widen (new candidate) or deepen (refine a promising line).

Search state

Budget used0 / 24

Nodes (candidates)1

Best score0.00

Wider / Deeper0 / 0

low score
high score
best path

Gemini 2.5 Pro
o4-mini
DeepSeek-R1

How Marlin Compares

Marlin competes on depth, not speed. Conventional deep-research tools answer in minutes to tens of minutes. Marlin deliberately spends hours to raise output quality. The competitor run times below are approximate and reported, not official figures.

READ ALSO

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough

Top Financial Data Labeling Companies for Secure AI Data

Tool Typical run time Output Primary user
Sakana Marlin Up to ~8 hours Report (dozens to ~100 pages) + slides Enterprise strategy teams
OpenAI Deep Research ~Minutes to tens of minutes Cited text report General and pro users
Perplexity Deep Research ~A few minutes Cited text answer General users
Google Gemini Deep Research ~Minutes Cited text report General and workspace users

The trade-off is explicit. You wait longer and pay per run. In return you get deeper hypothesis testing and a finished deliverable. You can cancel a run anytime, but credits are still consumed.

Pricing

Sakana offers pay-as-you-go along with Pro, Team, and Enterprise tiers. Pay-as-you-go starts at 100 credits per run, at ¥98 per credit. Pro is ¥150,000 per month and includes 2,000 credits. Team is ¥400,000 per month and includes 6,000 credits. Enterprise pricing is custom, with dedicated support.

Use Cases, With Examples

Marlin suits high-stakes questions where research is the bottleneck. Here are concrete examples drawn from its target tasks.

  • Market entry: ‘Assess Japan’s stablecoin and tokenized-payments market after regulatory change.’ Marlin maps drivers, risks, and structured options into a report.
  • Risk analysis: ‘Model resolution scenarios for a Strait of Hormuz blockade.’ It compares hypotheses, not just summaries, before drawing conclusions.
  • Competitive analysis: Profile three rivals and rank our positioning gaps. It returns slides ready for a strategy review.

Each example fits one prompt and one unattended run. A human still reviews the cited output before any decision.

Try the Engine Yourself: TreeQuest

You cannot self-host Marlin. But you can run its core algorithm today. Sakana open-sourced AB-MCTS as TreeQuest under the Apache 2.0 license. Install it, define a generate function, then run a fixed search budget.

import random
import treequest as tq

# Each node holds a user-defined state; score must be normalized to [0, 1].
def generate(parent_state):
    if parent_state is None:               # None means expand from the root
        new_state = "Initial draft"
    else:
        new_state = f"Refined: {parent_state}"
    score = random.random()                # swap this for an LLM-based score
    return new_state, score

algo = tq.ABMCTSA()                         # Adaptive Branching MCTS (variant A)
search_tree = algo.init_tree()

for _ in range(10):                         # generation budget of 10
    search_tree = algo.step(search_tree, {"generate": generate})

best_state, best_score = tq.top_k(search_tree, algo, k=1)[0]
print("BEST:", best_state, round(best_score, 3))

Swap the random score for an LLM judge to reproduce the real pattern. TreeQuest also ships multi-LLM search and checkpointing for long runs. Checkpointing matters because long sessions can hit API errors midway.

Strengths and Weaknesses

Strengths

  • Peer-reviewed foundations: AB-MCTS at NeurIPS and AI Scientist in Nature.
  • Finished deliverables, including references, appendices, and slides.
  • Adaptive compute spends effort on the most promising branches.
  • The open-source core (TreeQuest) lets AI researchers study the method.

Weaknesses

  • Long runtimes make iteration slow versus minute-scale research tools.
  • Automated reports can contain hard-to-spot errors that need human review.
  • Pricing and design target enterprises, not individual developers.
  • Marlin itself is closed; only the underlying algorithm is open.

Key Takeaways

  • Sakana Marlin runs autonomous research for up to about eight hours per task.
  • One run produces a report of dozens of pages, plus slides.
  • It builds on AB-MCTS (NeurIPS 2025 Spotlight) and AI Scientist workflows (Nature).
  • Entry pricing is pay-as-you-go: 100 credits per run at ¥98 per credit.
  • It targets finance, corporate strategy, consulting, and think-tank teams.

Sources

  • Sakana AI — Sakana Marlin release: https://sakana.ai/marlin-release/
  • Sakana AI — Sakana Marlin product page: https://sakana.ai/marlin/
  • Sakana AI — AB-MCTS research and TreeQuest: https://sakana.ai/ab-mcts/
  • SakanaAI/treequest (GitHub, Apache 2.0): https://github.com/SakanaAI/treequest




Source_link

Related Posts

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough
Al, Analytics and Automation

The Statistics of Token Selection: Logits, Temperature, and Top-P Walkthrough

June 15, 2026
Top Financial Data Labeling Companies for Secure AI Data
Al, Analytics and Automation

Top Financial Data Labeling Companies for Secure AI Data

June 15, 2026
Meet Flash-KMeans: An IO-Aware, Exact K-Means That Runs Over 200× Faster Than FAISS on GPUs
Al, Analytics and Automation

Meet Flash-KMeans: An IO-Aware, Exact K-Means That Runs Over 200× Faster Than FAISS on GPUs

June 15, 2026
Al, Analytics and Automation

Building a Context Pruning Pipeline for Long-Running Agents

June 15, 2026
Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agents Across Claude Code, Codex, and Pi
Al, Analytics and Automation

Databricks Open-Sources Omnigent: A Meta-Harness That Composes, Governs, and Shares AI Agents Across Claude Code, Codex, and Pi

June 14, 2026
Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient
Al, Analytics and Automation

Serving Multiple Users at Once: How Continuous Batching Keeps LLM Inference Efficient

June 14, 2026
Next Post
Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization

Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization

POPULAR NEWS

Trump ends trade talks with Canada over a digital services tax

Trump ends trade talks with Canada over a digital services tax

June 28, 2025
15 Trending Songs on TikTok in 2025 (+ How to Use Them)

15 Trending Songs on TikTok in 2025 (+ How to Use Them)

June 18, 2025
Communication Effectiveness Skills For Business Leaders

Communication Effectiveness Skills For Business Leaders

June 10, 2025
App Development Cost in Singapore: Pricing Breakdown & Insights

App Development Cost in Singapore: Pricing Breakdown & Insights

June 22, 2025
Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

Comparing the Top 7 Large Language Models LLMs/Systems for Coding in 2025

November 4, 2025

EDITOR'S PICK

RAG Architects for Enterprise AI Hiring Guide

RAG Architects for Enterprise AI Hiring Guide

March 25, 2026
HTX Learn and Earn XAUT Quiz Answers

HTX Learn and Earn XAUT Quiz Answers

January 31, 2026
AI and the Future of SEO: How SMBs Can Thrive in 2025

AI and the Future of SEO: How SMBs Can Thrive in 2025

June 25, 2025
Guarantee Examples for Small Business

Guarantee Examples for Small Business

April 28, 2026

About

We bring you the best Premium WordPress Themes that perfect for news, magazine, personal blog, etc. Check our landing page for details.

Follow us

Categories

  • Account Based Marketing
  • Ad Management
  • Al, Analytics and Automation
  • Brand Management
  • Channel Marketing
  • Digital Marketing
  • Direct Marketing
  • Event Management
  • Google Marketing
  • Marketing Attribution and Consulting
  • Marketing Automation
  • Mobile Marketing
  • PR Solutions
  • Social Media Management
  • Technology And Software
  • Uncategorized

Recent Posts

  • Satya Nadella warns that AI could hollow out entire industries, echoing the damage done by globalization
  • Sakana AI Commercializes AB-MCTS in Sakana Marlin, an Enterprise Agent Generating Up to 100-Page Research Reports With Slides
  • How I Use My AI Marketing Assistant After 200+ Hours
  • Google plans $1.5 billion Alabama data center expansion
  • About Us
  • Disclaimer
  • Contact Us
  • Privacy Policy
No Result
View All Result
  • Technology And Software
    • Account Based Marketing
    • Channel Marketing
    • Marketing Automation
      • Al, Analytics and Automation
      • Ad Management
  • Digital Marketing
    • Social Media Management
    • Google Marketing
  • Direct Marketing
    • Brand Management
    • Marketing Attribution and Consulting
  • Mobile Marketing
  • Event Management
  • PR Solutions