Add `RollbackToolRegistry` in the `Persistency` feature in order to roll back tool calls with side effects when checkpointing. Make `AIAgent` state-manageable and introduce `AIAgentService` to manage multiple uniform running agents. Deprecate concurrent unsafe `AIAgent.asTool` in favor of `AIAgentService.createAgentTool` #873

Ololoshechkin · 2025-09-26T17:31:50Z

Add RollbackToolRegistry in the Persistency feature in order to roll back tool calls with side effects when checkpointing:

val agent = AIAgent(
    toolRegistry = ToolRegistry {
        tool(::createUser)
        tool(::sendMessage)
        tool(::inviteMember)
    },
    ...
) {
    install(Persistency) {
        storage = myStateStorageImpl
    
        rollbackToolRegistry = RollbackToolRegistry {
            // TOOL -> "REVERSE" TOOL
            
            registerRollback(::createUser, ::deleteUser)
            registerRollback(::sendMessage, ::undoMessage)
            registerRollback(::inviteMember, ::revokeInvitation)
        }
    }
}

Now you can also manage (create, run, find, access running state) multiple AI Agents using AIAgentService :

val agentService = AIAgentService(
    toolRegistry = ToolRegistry {
        tool(::createUser)
        tool(::sendMessage)
        tool(::inviteMember)
    },
    ...
) {
    install(Persistency) {
        storage = myStateStorageImpl
    
        rollbackToolRegistry = RollbackToolRegistry {
            registerRollback(::createUser, ::deleteUser)
            registerRollback(::sendMessage, ::undoMessage)
            registerRollback(::inviteMember, ::revokeInvitation)
        }
    }
}

And then you can create instances of AIAgent, and manage their running state. This is particularly useful to rollback long-running operations if user realizes that agent took the wrong direction:

// user creates new agent:
post("/agent") {
    val input = call.receive<String>()

    val agent = agentService.createAgent()

    launch {
        agent.run(input
    }
    
    call.respond(agent.id)
}

// user checks the agent's state:
get("/agent") {
    val id = call.receive<String>()
    val agent = agentService.agentById(id)

    if (!agent.finished()) call.respondText("Agent is still running...")
    else call.respond(agent.resultIfReady()!!)
}

// user asks running agent to rollback:
data class RollbackRequest(val agentId: String, val checkpoint: String)

put("/agent/rollback") {
    val userRequest = call.receive<RollbackRequest>()
    val agent = agentService.agentById(userRequest.agentId)

    if (agent.finished()) {
         call.respondText("Agent has already finished!")
    } else {
        // Rolling back agent to a checkpoint
        agent.withRunningContext {
            withPersistency(this) { ctx ->
                rollbackToCheckpoint(userRequest.checkpoint, ctx)
            }
        }
    
        call.respond(HttpStatusCode.OK)
    }
}

Make AIAgent explicitly single-run. Previous semantic was: AIAgent.run can be called multiple times, but if called in parallel -- throws exception that agent is currently running.
This was creating a hard to manage contract for run() and was very error-prone (see example of such errors below in 4)
Fix Agent.asTool for parallel use AIAgent.asTool() fails with parallelTools due to agent is already running. #864 . Previously -- AIAgentTool was holding an instance of AIAgent and running it. When LLM decided to call tools in parallel, because of the previous contract of AIAgent it was failing in runtime with exception. Hence, AIAgent.asTool was broken.
Now -- intended usage is AIAgentService.createAgentTool().
AIAgent.asTool is now deprecated and is currently working correctly via AIAgentService.fromAgent(this).createAgentTool() (i.e. it creates an instance of AIAgentService from the current AIAgent, then creates a tool that holds a new copy of AIAgent).

Motivation and Context

Breaking Changes

Type of the changes

New feature (non-breaking change which adds functionality)
Bug fix (non-breaking change which fixes an issue)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation update
Tests improvement
Refactoring

Checklist

The pull request has a description of the proposed change
I read the Contributing Guidelines before opening the pull request
The pull request uses develop as the base branch
Tests for the changes have been added
All new and existing tests passed

Additional steps for pull requests adding a new feature

An issue describing the proposed change exists
The pull request includes a link to the issue
The change was discussed and approved in the issue
Docs have been added / updated

github-actions · 2025-09-27T20:20:17Z

Qodana for JVM

862 new problems were found

Inspection name	Severity	Problems
`Check Kotlin and Java source code coverage`	🔶 Warning	838
`Vulnerable imported dependency`	🔶 Warning	17
`Unused import directive`	🔶 Warning	6
`Missing KDoc for public API declaration`	🔶 Warning	1

@@ Code coverage @@
+ 70% total lines covered
12679 lines analyzed, 8901 lines covered
# Calculated according to the filters of your coverage tool

☁️ View the detailed Qodana report

Contact Qodana team

Contact us at qodana-support@jetbrains.com

Or via our issue tracker: https://jb.gg/qodana-issue
Or share your feedback: https://jb.gg/qodana-discussions

…back tool calls with side effects when checkpointing.

…ally needed

…DO: fix structured concurrency

…in tests

…more than once

…ool` because it's not working with parallel tools anyway

…Tool via it, fix example in docs

…tests

…ice.createAgentTool` Refs JetBrains#873

…oll back tool calls with side effects when checkpointing. Make `AIAgent` state-manageable and introduce `AIAgentService` to manage multiple uniform running agents. Deprecate concurrent unsafe `AIAgent.asTool` in favor of `AIAgentService.createAgentTool` (JetBrains#873) 1. Add `RollbackToolRegistry` in the `Persistency` feature in order to roll back tool calls with side effects when checkpointing: ```kotlin val agent = AIAgent( toolRegistry = ToolRegistry { tool(::createUser) tool(::sendMessage) tool(::inviteMember) }, ... ) { install(Persistency) { storage = myStateStorageImpl rollbackToolRegistry = RollbackToolRegistry { // TOOL -> "REVERSE" TOOL registerRollback(::createUser, ::deleteUser) registerRollback(::sendMessage, ::undoMessage) registerRollback(::inviteMember, ::revokeInvitation) } } } ``` 2. Now you can also manage (create, run, find, access running state) multiple AI Agents using `AIAgentService` : ```kotlin val agentService = AIAgentService( toolRegistry = ToolRegistry { tool(::createUser) tool(::sendMessage) tool(::inviteMember) }, ... ) { install(Persistency) { storage = myStateStorageImpl rollbackToolRegistry = RollbackToolRegistry { registerRollback(::createUser, ::deleteUser) registerRollback(::sendMessage, ::undoMessage) registerRollback(::inviteMember, ::revokeInvitation) } } } ``` And then you can create instances of AIAgent, and manage their running state. This is particularly useful to rollback long-running operations if user realizes that agent took the wrong direction: ```kotlin // user creates new agent: post("/agent") { val input = call.receive<String>() val agent = agentService.createAgent() launch { agent.run(input } call.respond(agent.id) } // user checks the agent's state: get("/agent") { val id = call.receive<String>() val agent = agentService.agentById(id) if (!agent.finished()) call.respondText("Agent is still running...") else call.respond(agent.resultIfReady()!!) } // user asks running agent to rollback: data class RollbackRequest(val agentId: String, val checkpoint: String) put("/agent/rollback") { val userRequest = call.receive<RollbackRequest>() val agent = agentService.agentById(userRequest.agentId) if (agent.finished()) { call.respondText("Agent has already finished!") } else { // Rolling back agent to a checkpoint agent.withRunningContext { withPersistency(this) { ctx -> rollbackToCheckpoint(userRequest.checkpoint, ctx) } } call.respond(HttpStatusCode.OK) } } ``` 3. Make `AIAgent` explicitly single-run. Previous semantic was: `AIAgent.run` can be called multiple times, but if called in parallel -- throws exception that agent is currently running. This was creating a hard to manage contract for `run()` and was very error-prone (see example of such errors below in `4`) 4. Fix `Agent.asTool` for parallel use JetBrains#864 . Previously -- `AIAgentTool` was holding an instance of `AIAgent` and running it. When LLM decided to call tools in parallel, because of the previous contract of `AIAgent` it was failing in runtime with exception. Hence, `AIAgent.asTool` was broken. Now -- intended usage is `AIAgentService.createAgentTool()`. `AIAgent.asTool` is now deprecated and is currently working correctly via `AIAgentService.fromAgent(this).createAgentTool()` (i.e. it creates an instance of `AIAgentService` from the current `AIAgent`, then creates a tool that holds a new copy of `AIAgent`). ## Motivation and Context  ## Breaking Changes  --- #### Type of the changes - [x] New feature (non-breaking change which adds functionality) - [ ] Bug fix (non-breaking change which fixes an issue) - [ ] Breaking change (fix or feature that would cause existing functionality to change) - [ ] Documentation update - [ ] Tests improvement - [ ] Refactoring #### Checklist - [x] The pull request has a description of the proposed change - [x] I read the [Contributing Guidelines](https://github.com/JetBrains/koog/blob/main/CONTRIBUTING.md) before opening the pull request - [x] The pull request uses **`develop`** as the base branch - [x] Tests for the changes have been added - [ ] All new and existing tests passed ##### Additional steps for pull requests adding a new feature - [ ] An issue describing the proposed change exists - [ ] The pull request includes a link to the issue - [ ] The change was discussed and approved in the issue - [x] Docs have been added / updated

Ololoshechkin requested a review from Rizzen September 26, 2025 17:31

Ololoshechkin requested a review from serge-p7v September 29, 2025 09:00

Ololoshechkin mentioned this pull request Sep 29, 2025

Updated documentation on functional agents #881

Merged

15 tasks

Ololoshechkin added 19 commits September 30, 2025 14:23

Add RollbackToolRegistry in the Persistency feature in order to roll …

a1facda

…back tool calls with side effects when checkpointing.

Add additionalRollbackActions in order to perform rollback where actu…

287cdb0

…ally needed

Fix messageHistoryDiff between current state and checkpoint

ad8ed9b

Add tests

e9a8d9a

Add JVM version for function references, and documentation

821a6b0

Fix KLint

5bf4a76

Work In Progress -- also adding sessions to agents, and add tests. TO…

6087d58

…DO: fix structured concurrency

Work In Progress -- fixed concurrencyt. TODO: fix checkpoint rollback

6aec422

Make it working !

2c3e313

Fix ktlint

2cd8f24

Rename rootContext to parentRootContext

d59c23b

Fix agent-persistency.md

d4adf1d

Fix CE

2c37352

Fix KTLint

a2d1615

Rollback AIAgentSession

b28eb98

Introduce AIAgentService, make AIAgent manageable, and rewrite logic …

d101cc4

…in tests

fix examples

979e828

Add resultIfReady and agentById

068cf37

fix KLint and example CE

35b86c4

Ololoshechkin added 14 commits September 30, 2025 14:28

Change PersistencyRunsTwiceTest and not allow running the same agent …

b6c0c02

…more than once

Use AIAgentService everywhere where multiple agent runs were required.

9f550e1

Unify Graph and Functional agents with StatefulSingleUseAIAgent

38eea99

Make all technical fields private in StatefulSingleUseAIAgent

d5708a1

Introduce AIAgentService.createAgentTool and deprecaet `AIAgent.asT…

e4b5e4a

…ool` because it's not working with parallel tools anyway

Introduce AIAGentService.fromAgent(AIAgent) and workaround AIAgent.as…

9dc6c20

…Tool via it, fix example in docs

fix typo in doc example

12ace1e

Fix AIAgentToolTest by inheriting MockAgent from GraphAIAgent

15f132e

Support multiple agents with same ID in AIAgentService, and add unit …

8e6303b

…tests

Fix KtLint

0196c73

Fix OpenTelemetryTestAPI compilation

5144510

Fix OpenTelemetryTest

82e7f16

Update API after design discussion with @Rizzen

2c6056a

Fix tests

e80d688

Rizzen approved these changes Sep 30, 2025

View reviewed changes

Ololoshechkin force-pushed the vbr/rollback-sideeffect-tools-in-persistency branch from 8a3a423 to e80d688 Compare September 30, 2025 12:34

rebase

c450e6a

Ololoshechkin merged commit eca6b46 into develop Sep 30, 2025
11 checks passed

Ololoshechkin deleted the vbr/rollback-sideeffect-tools-in-persistency branch September 30, 2025 14:05

Ololoshechkin mentioned this pull request Sep 30, 2025

AIAgent.asTool() fails with parallelTools due to agent is already running. #864

Closed

valery1707 added a commit to valery1707/koog that referenced this pull request Oct 17, 2025

Deprecate concurrent unsafe AIAgent.asTool in favor of `AIAgentServ…

1fc9300

…ice.createAgentTool` Refs JetBrains#873

valery1707 mentioned this pull request Oct 17, 2025

Deprecate concurrent unsafe AIAgent.asTool in favor of AIAgentService.createAgentTool #992

Open

15 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Ololoshechkin commented Sep 26, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Sep 27, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Ololoshechkin commented Sep 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Breaking Changes

Type of the changes

Checklist

Additional steps for pull requests adding a new feature

Uh oh!

github-actions bot commented Sep 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Qodana for JVM

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Ololoshechkin commented Sep 26, 2025 •

edited

Loading

github-actions bot commented Sep 27, 2025 •

edited

Loading