0% found this document useful (0 votes)

64 views113 pages

Lang Graph

Uploaded by

chung tien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

64 views113 pages

Lang Graph

Uploaded by

chung tien

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 113

Complete LangGraph Crash Course

Advanced

Intermediate

Basic
Course Overview 💬

1. Levels of Autonomy in LLM applications (Code -> LLM Call -> Chain -> Router ->
Agent)

2. Understanding Agents & Tools

3. Building Agents & Tools from Scratch

4. Building Agents From pre-deﬁned LangChain classes

Course Overview 💬

5. Graph Structure
- Direct Acyclic Graph (DAG) vs Cyclic Graph

6. What is LangGraph?

7. Why LangGraph is required?

8. Creating LangGraph from scratch

9. Creating a LangGraph using in-built classes (Reﬂection, Reﬂexion agents, etc.)

10. Key concepts and terms in LangGraph

- Graph, state, nodes, edges, visualisation, checkpoints, breakpoints,
conﬁguration
Course Overview 💬

11. Creating a Chatbot with LangGraph

12. Common Agentic Patterns

- Human-in-the-loop
- ReAct Agent, and many more.

13. Multi-agent systems using LangGraph

14. RAGs with LangGraph: CRAG vs ARAG vs self-RAG

15. Persistence
Course Overview 💬

16. LangGraph ecosystem

- LangGraph Studio
- LangGraph Cloud API, etc

17. Agents in Production

Pre-requisites

1. You need to have Python 3.8 or higher installed

2. LangChain (as LangGraph builds on top of LangChain)
Levels of Autonomy in LLM applications

1. Code

Code has zero autonomy and is 100% deterministic

We all know that everything is hard-coded and it is not even really a cognitive
architecture.

Disadvantage:
The problem? You'd need to write rules for every possible scenario - making it
impossible to handle real-world complexity.
Levels of Autonomy in LLM applications

2. LLM call

A single LLM call means your app basically does one main thing - you give it an
input, it processes it, and gives you back an output.

Think of chatbots that just take your message and respond, or apps that
translate text.

This was a huge leap from hard-coded rules, even though it's still pretty simple
and is only in the 2nd stage of autonomy
Levels of Autonomy in LLM applications

2. LLM call

User Input LLM Output

Example User Input: You are an expert LinkedIn post writer. Write me a post on "AI
Agents Taking over Content Creation"
Levels of Autonomy in LLM applications

2. LLM call

A single LLM call means your app basically does one main thing - you give it an
input, it processes it, and gives you back an output.

Think of chatbots that just take your message and respond, or apps that
translate text.

This was a huge leap from hard-coded rules, even though it's still pretty simple
and is only in the 2nd stage of autonomy

Disadvantage:
Trying to get everything done in one shot often leads to confused or mixed-up
responses - just like how a single person can't be an expert at everything.
Levels of Autonomy in LLM applications

3. Chains

Think of chains like having multiple specialists instead of one generalist. Instead
of asking one AI to do everything, we break it down into steps where each AI is
really good at one thing.

Imagine a customer service chatbot: The ﬁrst AI reads your complaint and
ﬁgures out exactly what product you're talking about

The second AI ﬁnds the right solution from the company's help docs, and the
third AI turns that solution into a friendly response.

Each step is simple, but together they create a much smarter system than a
single LLM call could.
Levels of Autonomy in LLM applications

3. Chains (contd.)

This is where we ﬁrst started seeing AI apps that could handle more complex
tasks - not just by being smarter, but by breaking big problems into smaller,
manageable pieces.

Disadvantage:
The downside? These ﬁxed sequences are like a rigid assembly line - they always
follow the same steps deﬁned by the human.
Levels of Autonomy in LLM applications

3. Chains (contd.)

Prompt
LLM Output
template {title}
LinkedIn

Twitter Prompt
Post title LLM Output
template {title}

Blog post
Prompt
LLM Output
template {title}

Example User Input: "AI Agents taking over Content Creation"

Levels of Autonomy in LLM applications

4. Router

Now this is where it gets interesting - routers are like smart traffic cops for your
AI. Instead of having a ﬁxed path like in chains, the AI itself decides what steps
to take next.

Imagine a personal assistant bot: when you ask it something, it ﬁrst ﬁgures out
if you need help with scheduling, research, or calculations, then routes your
request to the right tool or chain for the job.
Levels of Autonomy in LLM applications

4. Router LinkedIn chain

Prompt
LLM Output
template {title}

Twitter chain

User input Router LLM Input Prompt Output

LLM
Classiﬁer template {title}

Blog post chain

Example User Input: Write me a LinkedIn Prompt Output

template {title}
LLM
post on "AI Agents Taking over Content
Creation"
Levels of Autonomy in LLM applications

4. Router

Now this is where it gets interesting - routers are like smart traffic cops for your
AI. Instead of having a ﬁxed path like in chains, the AI itself decides what steps
to take next.

Disadvantage:
While it can choose different paths, it still can't remember previous
conversations or learn from mistakes.
Levels of Autonomy in LLM applications

5. State Machine (Agent)

This is combining the previous level (router) but with loops.

Agent ~= control ﬂow controlled by an LLM

This involves features like:

1. Ability to have human-in-loop, ask for approval before moving on
2. Multi-agent systems
3. Advanced memory management
4. Go back in history and explore better alternate paths
5. Adaptive Learning

And many more, and THIS is where LangGraph comes into the picture
Levels of Autonomy in LLM applications

5. State Machine
Approval
LinkedIn Script
writer agent
Step

Head of Blog Post

User input Content Agent Writer agent

Tool 1

Social Media
Example User Input: Write me a LinkedIn Publisher agent
post on "AI Agents Taking over Content Tool 2
Creation"
Tool 3
Levels of Autonomy in LLM applications

Chain/Router vs Agent

A Chain or even a router is one directional. Hence, it is not an agent

Whereas in a state machine, we can go back in the chain, have cycles and the
ﬂow is controlled by the LLM, hence it is called an Agent
Understanding AI Agents
AI Agents & Tools

Think of Agents as the "problem-solvers" of the AI world. Agents are capable of

thinking on their own.

In other words, it's AI that can make autonomous decisions.

In the case of Chains and Router, they follow our speciﬁc instruction.

But with agents, they actually take it a step further. They can decide for
themselves what steps to take on their own.
AI Agents & Tools

What are tools then?

Tools are speciﬁc functions that Agents can use to complete tasks

Just like a chef's kitchen tools (knife for cutting, oven for baking, blender for
mixing), tools are the special abilities we give to AI - like giving it a calculator
tool, or a search engine tool, or a calendar tool
AI Agents & Tools (Re-Act Agent Pattern)

This is one of the best known patterns in AI today to build agents. It stands for
Reasoning + Acting

This is basically a concept that mimics how human beings think.

ReACT pattern

Think: LLM ﬁrst thinks about the user prompt/problem

Action: LLM decides if it can answer by itself or if it should use a tool
Action Input: LLM provides the input argument for the tool
Here, langChain executes the tool and returns the output to the LLM

Observe: LLM observe the result of the tool

Final Answer: "This is your ﬁnal answer"

ReACT pattern

Re-Act Agent

Tools LLM
Let's Jump Into The Code

1. Build a Re-act Agent Using LangChain

2. What are its drawbacks and where does LangGraph come into the picture?
ReAct Agents

ReAct Agents Are Flexible. i.e., Any state is possible

tool 1
Start End

tool 2
Start End

tool 1 tool 2
Start End

tool 2 tool 1
Start End
ReAct Agents

But high ﬂexibility can also mean less reliability

tool 1 tool 1 tool 1 tool 1

Start

Inﬁnite Loop causes:

1. We did not deﬁne the tools correctly
2. The LLM is not capable enough
3. The prompting doesn't deﬁne a clear end condition
Best Of Both Worlds

Chain React Agent

tool 1 tool 2

Start End

Tools LLM

Not Flexible Flexible & Flexible

More Reliable Reliable Less Reliable
Best Of Both Worlds

Chain LangGraph React Agent

tool 1 tool 2

Start End

Tools LLM

Not Flexible Flexible & Flexible

More Reliable Reliable Less Reliable
What is LangGraph?

A framework for building controllable,

persistent agent workﬂows with built-in
support for human interaction, streaming, and
state management.

It uses the Graph Data Structure to achieve

this
Key Features Of LangGraph

1. Looping and Branching Capabilities:

Supports conditional statements and loop structures, allowing dynamic execution
paths based on state.

2. State Persistence:
Automatically saves and manages state, supporting pause and resume for
long-running conversations.

3. Human-Machine Interaction Support:

Allows inserting human review during execution, supporting state editing and
modiﬁcation with ﬂexible interaction control mechanisms.
Key Features Of LangGraph

4. Streaming Processing:
Supports streaming output and real-time feedback on execution status to
enhance user experience.

5. Seamless Integration with LangChain:

Reuses existing LangChain components, supports LCEL expressions, and offers
rich tool and model support.
Why use the Graph Data Structure?
Core Components of LangGraph
Core Components of LangGraph

1. Nodes

2. Edges

3. Conditional Edges

4. State
Example: Reﬂection Agent pattern

_start_

Generate
tweet LLM

Criticize tweet _END_

LLM
Reﬂection Agents in LangGraph
Reﬂection Agents in LangGraph

1. What is a Reﬂection Agent System?

2. Three types of Reflection Agent Systems
3. Setup & Installations
4. Implement a reflection Agent System
Reflection Agent pattern in LangGraph

But what does the English word "reﬂection" mean?

Like how you're looking at your reﬂection in the mirror, reﬂection means
looking at yourself or your actions

For example:

- After giving a presentation, thinking about how it went

- After writing an email, reading it again to check if it's clear
- After making a decision, considering if it was the right choice
Reﬂection Agent pattern in LangGraph

So what is a reﬂection-agent pattern?

A reﬂection agent pattern is an AI system pattern that can look at its own
outputs and think about them/make it better - just like how we look at
ourselves in a mirror and self-reﬂect, make ourselves better

A basic reﬂection agent system typically consists of:

1. A generator agent
2. A reﬂector agent
Example: Basic Reﬂection Agent pattern

_start_

Tweet generation
agent

Tweet critique
_END_
agent
Reﬂection Agent pattern in LangGraph
Types of Reﬂection Agents in LangGraph

There are 3 types:

1. Basic Reﬂection Agents

2. Reflexion Agents
3. Language Agent Tree Search (LATS)
Let's Implement a Basic Reflection Agent!
Let's Implement a Basic Reflection Agent!

In this section, we'll build:

1. generation_chain
2. reﬂect_chain
Basic Reﬂection Agent!

What is a MessageGraph?

It is a class that LangGraph provides that we can use to orchestrate the ﬂow
of messages between different nodes

Example use cases: Simple routing decisions, simple

chatbot conversation ﬂow

If you just want to pass messages along between

nodes, then go for MessageGraph

If the app requires complex state management, we

have StateGraph (more on this later)
Basic Reﬂection Agent!

What is a MessageGraph?

To put it simply, MessageGraph maintains a list of messages and decides the

ﬂow of those messages between nodes

Every node in MessageGraph receives the full list of

previous messages as input

Each node can append new messages to the list and

return it

The updated message list is then passed to the next

node
Reﬂexion Agent System
Reﬂexion Agents in LangGraph

Recap of what we saw previously:

Reﬂection Agent System consists of a generator and a reﬂector component

Although, iteratively making a post better is signiﬁcantly better than just

prompting ChatGPT, the content generated is still not grounded in live data

It could be hallucination or outdated content and we have no way of knowing

Reﬂexion Agent System address this exact drawback

Reﬂexion Agents in LangGraph

What is Reﬂexion Agent System:

The reﬂexion agent, similar to reﬂection agent, not only critiques it's own
responses but also fact checks it with external data by making API calls (Internet
Search)

In the Reﬂection agent pattern, we had to rely on the training data of LLMs but in
this case, we're not limited to that.
Reﬂexion Agents in LangGraph

What is Reﬂexion Agent System:

The main component of Reﬂexion Agent System is the "actor"

The "actor" is the main agent that drives everything - it reﬂects on it's responses
and re-executes.
It can do this with or without tools to improve based on self-critique that is
grounded in external data

It's main sub-components include:

1. Tools/tool execution
2. Initial responder: generate an initial response & self-reflection
3. Revisor: re-respond & reflect based on previous reflections
Reflexion Agents in LangGraph

Episodic memory

In the context of Reﬂexion agents, episodic memory refers to an agent's ability

to recall speciﬁc past interactions, events, or experiences, rather than just
generalized knowledge.

This is crucial for making agents feel more context-aware, personalized, and
human-like over time.
Reﬂexion Agent System
Let's Implement a Reﬂexion Agent!
LLM Response Parser System

The system converts unstructured LLM outputs into well-deﬁned Python objects
through a series of structured parsing steps, ensuring data validation and
consistent formatting.

What are the key components?

1. Chat Prompt Template

2. Function Calling with Pydantic Schema
3. Pydantic Parser
LLM Response Parser System

2. Function Calling with Pydantic Schema

Function calling:
Similar to how we make tools available to the LLM, we can also send a schema to
the LLM and force it to structure it's JSON output according to the schema

Pydantic:
A Python library that deﬁnes data structures using classes
Provides automatic validation of JSON data against these class deﬁnitions
LLM Response Parser System

3. Pydantic Parser

Takes the JSON output from the LLM's function call

Validates it against the defined Pydantic schema (class definition)
Creates instances of Pydantic classes with the validated data
If the LLMs output does not match with the defined schema, it will throw an error
Re-Act Agent using LangGraph
Re-Act Agent using LangGraph

Think: LLM ﬁrst thinks about the user prompt/problem

Observe: LLM observe the result of the tool

Final Answer: "This is your ﬁnal answer"

ReAct Agent

LangChain:

In LangChain, we used initialize_agent as an all-in-one solution.

It combines two key components:

1. create_react_agent
2. AgentExecutor (We will eliminate the need for this in LangGraph)
ReAct Agent

1. create_react_agent: (one that creates the agent)

Takes each tool's name and description

Formats them into a standardized way the LLM can understand
Inserts them into speciﬁc placeholders in the ReAct prompt template
It makes the LLM call + takes the LLMs response + parses it

It parses the response of the LLM into one of these two classes: AgentAction or
AgentFinish
Re-Act Agent using LangGraph

AgentAction:

This is a LangChain class that represents an action the agent wants to take. It
typically contains:
Re-Act Agent using LangGraph

AgentFinish:

This represents the agent completing its task with a ﬁnal answer. It typically
contains:
ReAct Agent

2. AgentExecutor:

Takes the agent from create_react_agent and manages the execution loop
Receives the user's question and feeds it to the agent
Identiﬁes which tool to run based on the agent's output (AgentAction or No tool
if AgentFinish)
Executes the tool and captures the result
Feeds the result back to the agent for the next decision
Continues this loop until the agent produces an AgentFinish
Returns the ﬁnal answer to the user
ReAct Agent - LangGraph

Key Advantage:

LangGraph turns the hidden "black box" loop into a visible, editable workﬂow

You can now add custom nodes, modify the ﬂow, or insert additional logic
Re-Act Agent using LangGraph

The "reason" node does what create_react_agent did -

it thinks and decides
Start
If the reason node outputs an AgentAction, then "act"
node executes the tool

Results from the tool ﬂow back to "reason" node for the
reason node
next decision

When the agent has the ﬁnal answer, it takes the right
path to "end"
act node End
This visualization makes the "black box" of
AgentExecutor transparent and modiﬁable
State in LangGraph - Deep Dive

What is State in LangGraph?

State in LangGraph is a way to maintain and track information as an AI system

processes data.

Think of it as the system’s memory, allowing it to remember and update

information as it moves through different stages of a workﬂow, or graph.
State in LangGraph - Deep Dive

Concepts we will learn:

1. What is StateGraph?

2. Basic State Structures

3. Complex State Structures

4. Manual State Transformation

5. Declarative Annotated State Transformation

State in LangGraph - Deep Dive

1. Basic State Deﬁnition

Initial state:

Start

increment

should_continue End
ReAct Agent in LangGraph - Deep Dive

States to keep track of:

1. Input that the user provided

2. The parsed output of the LLM response (AgentAction or AgentFinish)

3. The history that has taken place so far

Chatbots using LangGraph

What we'll cover in the next few sections:

1. Basic Chatbot (no memory)

2. Chatbot with Tools
3. Chatbot with memory
4. Chatbot with human-in-loop scenarios
5. Chatbot with more complex state
6. Understanding Time-travel
Chat bots using LangGraph

1, Basic Chatbot

- No memory
- No tools

We'll learn:

- Graph stream method

- Chat looping
- Use free Llama Model using the Groq interface
Chat bots using LangGraph

2, Chatbot with Tools

- Adding a tool call ability

Memory & Checkpointers

When you build a basic chatbot using LangGraph, you run into an immediate
limitation: by default, your chatbot has amnesia.

Every time a user starts a conversation, the bot has no recollection of previous
interactions.
This happens because without memory management, each invocation of your
graph is completely independent.
This is where the concept of checkpointers in LangGraph come into the picture
Memory & Checkpointers

What is a Checkpointer?

A checkpointer in LangGraph is essentially a way to save the state of your agent

or workﬂow at speciﬁc points during execution.

Think of it like saving your progress in a video game. When you reach a
checkpoint:
1. The current state of everything is saved
2. If something goes wrong later, you can return to this saved point
3. You don't have to start over from the beginning
Memory & Checkpointers

What is a Checkpointer? (contd.)

In the context of LangGraph nodes and workﬂows:

● Nodes are the individual steps or components in your workﬂow

● Checkpoints save the complete state after a node ﬁnishes its work
● If an error occurs in a later node, you can resume from the last checkpoint rather than starting
the entire workﬂow again

This is particularly useful for complex workﬂows where:

● Processing takes signiﬁcant time or resources

● You want to implement retry mechanisms
● You need persistence across sessions or server restarts
Memory & Checkpointers

Thread ID:

A thread ID is simply a unique identifier for each specific conversation or workflow execution. Think
of it like:
- A unique session ID for a user
- A conversation ID that groups related messages together

The thread ID is necessary because:

1. You might have multiple conversations/workﬂows running simultaneously

2. Each needs its own separate saved state

3. The thread ID helps the system know which saved state belongs to which conversation

Without thread IDs, all your conversations would share the same state, which would cause confusion
and errors.
Human In The Loop

A human-in-the-loop workﬂow integrates human input into automated processes, allowing

for decisions, validation, or corrections at key stages

This is especially useful in LLM-based applications, where the underlying model may
generate occasional inaccuracies.

Use-cases:

1. Reviewing tool calls: Humans can review, edit, or approve tool calls requested by the LLM
before tool execution.

2. Validating LLM outputs: Humans can review, edit, or approve content generated by the
LLM

3. Providing context: Enable the LLM to explicitly request human input for clariﬁcation or
additional details or to support multi-turn conversations.
Human In The Loop (Design Patterns)

There are typically three different actions that you can do with a human-in-the-loop
workﬂow:

1. Approve or Reject:

Pause the graph before a critical step, such as an API call, to review and approve the action.

If the action is rejected, you can prevent the graph from executing the step, and potentially
take an alternative action. This pattern often involve routing the graph based on the human's
input.
Human In The Loop (Design Patterns)

There are typically three different actions that you can do with a human-in-the-loop
workﬂow:

1. Approve or Reject:

Depending on the human's approval

or rejection, the graph can proceed
with the action or take an
alternative path.
Human In The Loop (Design Patterns)

2. Review & Edit State:

A human can review and edit the state of the graph. This is useful for
correcting mistakes or updating the state with additional information.
Human In The Loop (Design Patterns)

3. Review Tool Calls:

A human can review and edit the output from the LLM before proceeding.

This is particularly critical in applications where the tool calls requested by the LLM
may be sensitive or require human oversight.
Human In The Loop (Design Patterns)

Start

generate_post

post collect_feedback

END
Human In The Loop

Drawbacks of input():

● Freezes your program completely until someone types something

● Only works in terminals - useless for web apps

● If your program crashes, all progress is lost

● Can only handle one user at a time

● Lives only in your terminal session

This is why we use a special method that LangGraph provides called "interrupt"
Human In The Loop

What is interrupt() & Why Use It?:

● Special LangGraph function that pauses your workﬂow nicely

● Saves your program's state so it can continue later

● Works in web apps, APIs, and other interfaces

● Handles multiple users/sessions at once

● Survives program crashes and restarts

● Lets humans take their time to respond

● Required for any serious human-in-the-loop system

Human In The Loop

Two ways for using Interrupts:

Interrupt function with

Interrupt in the compile step Command Class
Command Class

The Command class in LangGraph allows us to create edgeless workﬂows

Command Class

START

node_a

node_b

node_c

END
Command Class

START

node_a

node_b

node_c node_d

END
Interrupts

Operations with Interrupts:

1. Resume - Continue execution with input from the user without modifying the state

2. Update and Resume - Update the state and then continue execution

3. Rewind/time Travel - Go back to a previous checkpoint in the execution

4. Branch - Create a new branch from the current execution state to explore alternative
paths

5. Abort - Cancel the current execution entirely

Each of these operations gives you different ways to control the ﬂow of your graph when it's
interrupted
Structured Outputs

It is often useful to have a model return output that matches a speciﬁc schema that we
deﬁne

We have options to get outputs in formats such as - JSON, Dictionary, string, YAML, HTML
Structured Outputs

Pydantic Models for Structured Outputs:

1. Pydantic is a Python library that helps deﬁne data structures

2. Acts like a "blueprint" for data

3. Uses Python's type hints (like str, int) to enforce correct data types

How it works in LangChain/LangGraph:

1. Deﬁne a class with the ﬁelds you need (name, capital, language)

2. Add descriptions to explain what each ﬁeld means

3. Use with_structured_output() to tell the LLM to follow your format

Retrieval Augmented Generation (RAG) System

1. Knowledge-base construction:

[1, 0 ,1,
Chunk 0, 0]
Source
Documents Chunking [1, 0 ,1,
(~1K tokens) Chunk Vector DB
(~1M tokens) 0, 0]

Chunk [1, 0 ,1,

0, 0]

2. Query Processing:
LLM
Chunk """
Chunk 1 Text
Chunk 2 Text
[1, 0 ,1, Retriever Chunk Chunk 3 Text
Query 0, 0]
Question
Query """
Classiﬁcation-Driven Retrieval System
Advanced multi-step RAG System
Advanced multi-step RAG System

question_rewriter node signiﬁcance:

Initial query: "What are Peak Performance Gym's hours?"

Follow-up query: "What about weekends?"

Without rephrasing, the follow-up query lacks context on its own and would likely return
irrelevant results if sent directly to a retrieval system.

The rephrasing node transforms "What about weekends?" into "What are Peak Performance
Gym's weekend hours?"
Multi-Agent Architectures

We know that an agent is a system that uses an LLM to decide the control ﬂow of an
application.

As you develop these systems, they might grow more complex over time, making them
harder to manage and scale.

For example, you might run into the following problems:

● Agent has too many tools at its disposal and makes poor decisions about which tool to
call next
● context grows too complex for a single agent to keep track of

● there is a need for multiple specialization areas in the system (e.g. planner, researcher,
math expert, etc.)

To tackle these, you might consider breaking your application into multiple smaller,
independent agents and composing them into a multi-agent system.
Multi-Agent Architectures

These independent agents can be as simple as a prompt and an LLM call, or as complex as a
ReAct agent

The primary beneﬁts of using multi-agent systems are:

● Modularity: Separate agents make it easier to develop, test, and maintain agentic
systems.

● Specialization: You can create expert agents focused on speciﬁc domains, which helps
with the overall system performance.

● Control: You can explicitly control how agents communicate (as opposed to relying on
function calling).
Multi-Agent Architectures
Subgraphs

Subgraphs allow you to build complex systems with multiple components that are themselves
graphs. A common use case for using subgraphs is building multi-agent systems.

The main question when adding subgraphs is how the parent graph and subgraph
communicate, i.e. how they pass the state between each other during the graph execution.

There are two scenarios:

● parent graph and subgraph share schema keys. In this case, you can add a node with
the compiled subgraph

● parent graph and subgraph have different schemas. In this case, you have to add a
node function that invokes the subgraph: this is useful when the parent graph and the
subgraph have different state schemas and you need to transform state before or after
calling the subgraph
Supervisor Multi-agent Architecture

In this architecture, we deﬁne agents as nodes and add a supervisor node (LLM) that
decides which agent nodes should be called next.

We use Command to route execution to the appropriate agent node based on supervisor's
decision.
Supervisor Multi-agent Architecture

Supervisor

User Prompt

Enhancer Researcher Coder Validator

Streaming in LangGraph

If we're building a responsive app for the users, real-time updates are key to keeping them
engaged.

Common use-cases are:

1. Workﬂow progress (e.g., get state updates after each graph node is executed).

2. LLM tokens as they’re generated.

3. Custom updates (e.g., "Fetched 10/100 records")

Streaming in LangGraph

.stream and .astream are sync and async methods for streaming back outputs from a graph
run.

There are several different modes you can specify when calling these methods (e.g.
`graph.stream(input, stream_mode="values")):

Most common modes are:

1. stream_mode = "values"

This streams the full value of the state after each step of the graph.

2. stream_mode = "updates"

This streams the updates to the state after each step of the graph
Streaming in LangGraph

stream_mode = "updates" vs stream_mode = "values"

Streaming in LangGraph

In prod apps, we usually want to stream more than the state.

In particular, with LLM calls it is common to stream the tokens as they are generated.

We can do this using the `.astream_events` method, which streams back events as they
happen inside nodes

Each event is a dict with a few keys:

* `event`: This is the type of event that is being emitted.

* `name`: This is the name of event.

* `data`: This is the data associated with the event.

* `metadata`: Which contains `langgraph_node`, the node emitting the event.

Deployment

Deployment:

So far, we have built an API around the agent

The last step is to deploy our graph using

industry-standards.

The standard approach to this is using Docker

containers
Conclusion

Self-Hosting LangGraph Agents:

TLDR: LangGraph Cloud Optional

1. Build API wrapper around your graph

2. Containerize with Docker

3. Deploy to your preferred cloud provider

LLM Guide for Interns
No ratings yet
LLM Guide for Interns
4 pages
LangChain in Action v5 MEAP
100% (1)
LangChain in Action v5 MEAP
372 pages
Machine Learning Algorithms Theory - Vimal Mishra
No ratings yet
Machine Learning Algorithms Theory - Vimal Mishra
931 pages
Lang Chain
No ratings yet
Lang Chain
143 pages
Sandeep Interview
No ratings yet
Sandeep Interview
27 pages
A Student Guide
No ratings yet
A Student Guide
420 pages
Apache Spark vs Dask: Big Data Tools
No ratings yet
Apache Spark vs Dask: Big Data Tools
55 pages
Probability and Statistics For ML - Cwa
No ratings yet
Probability and Statistics For ML - Cwa
822 pages
Python AI ML Complete Roadmap With Skills
No ratings yet
Python AI ML Complete Roadmap With Skills
3 pages
6 Types of Neural Network
No ratings yet
6 Types of Neural Network
8 pages
Bias-Variance Tradeoff Presentation
No ratings yet
Bias-Variance Tradeoff Presentation
11 pages
GenAI Pinnacle Plus Brochure
No ratings yet
GenAI Pinnacle Plus Brochure
10 pages
Whitepaper Emebddings Vectorstores v2
No ratings yet
Whitepaper Emebddings Vectorstores v2
64 pages
Data Science and Machine Learning Interview Questions Using Python Second Edition Vishwanathan Narayanan PDF Version
No ratings yet
Data Science and Machine Learning Interview Questions Using Python Second Edition Vishwanathan Narayanan PDF Version
138 pages
My CV
No ratings yet
My CV
2 pages
RAG Systems Evaluation Guide
No ratings yet
RAG Systems Evaluation Guide
8 pages
7 Libraries That Help in Time-Series problems-AI Data Science
No ratings yet
7 Libraries That Help in Time-Series problems-AI Data Science
20 pages
Machine Learning: Trustworthy
No ratings yet
Machine Learning: Trustworthy
267 pages
Synthetic Data For Deep Learning: Generate Synthetic Data For Decision Making and Applications With Python and R 1st Edition Necmi Gürsakal Online Version
No ratings yet
Synthetic Data For Deep Learning: Generate Synthetic Data For Decision Making and Applications With Python and R 1st Edition Necmi Gürsakal Online Version
158 pages
BERT
No ratings yet
BERT
21 pages
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
No ratings yet
100-Machine-Learning-Interview-Questions-and-Answers (Downloaded From Internet)
24 pages
Q-1: What Is Python, What Are The Benefits of Using It, and What Do You Understand of PEP 8?
No ratings yet
Q-1: What Is Python, What Are The Benefits of Using It, and What Do You Understand of PEP 8?
140 pages
YourBusinessSecret - Timeline Template
No ratings yet
YourBusinessSecret - Timeline Template
54 pages
LLM Ai Interview SS
No ratings yet
LLM Ai Interview SS
187 pages
Databricks Guide To Agent Systems
No ratings yet
Databricks Guide To Agent Systems
16 pages
ML System Design
100% (1)
ML System Design
11 pages
Clean Code Principles: Python Guide
No ratings yet
Clean Code Principles: Python Guide
180 pages
You Exec - Student Loan Tracker Free
No ratings yet
You Exec - Student Loan Tracker Free
129 pages
Machine Learning For Business: Using Amazon SageMaker and Jupyter 1st Edition Doug Hudgeon Full Chapters Instanly
100% (1)
Machine Learning For Business: Using Amazon SageMaker and Jupyter 1st Edition Doug Hudgeon Full Chapters Instanly
129 pages
Machine Learning
No ratings yet
Machine Learning
31 pages
The Rise of Vector Databases in The Age of LLMs
No ratings yet
The Rise of Vector Databases in The Age of LLMs
26 pages
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
No ratings yet
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
110 pages
LLM Development Pipeline
No ratings yet
LLM Development Pipeline
101 pages
Understanding Vector Embeddings
No ratings yet
Understanding Vector Embeddings
14 pages
AI - ML Resource Sheet
100% (1)
AI - ML Resource Sheet
10 pages
Daily Dose of Data Science Full Archive
No ratings yet
Daily Dose of Data Science Full Archive
53 pages
LoRA Techniques for LLM Fine-Tuning
No ratings yet
LoRA Techniques for LLM Fine-Tuning
27 pages
Kubernetes Kubectl Commands
No ratings yet
Kubernetes Kubectl Commands
110 pages
Azure Cognitive Services Openai PDF
No ratings yet
Azure Cognitive Services Openai PDF
246 pages
Hugging Face Transformers: A Step-By-Step Guide
No ratings yet
Hugging Face Transformers: A Step-By-Step Guide
12 pages
5 Techiques To FineTune LLMs
No ratings yet
5 Techiques To FineTune LLMs
7 pages
Data Science Interview Q - A
No ratings yet
Data Science Interview Q - A
165 pages
AI Engineer Resume
No ratings yet
AI Engineer Resume
2 pages
Slides Deep Learning On AWS With NVIDIA From Training To Deployment
No ratings yet
Slides Deep Learning On AWS With NVIDIA From Training To Deployment
48 pages
Kubernetes For Generative AI Solutions - Sukirti GuptaSukirti Gupta
100% (1)
Kubernetes For Generative AI Solutions - Sukirti GuptaSukirti Gupta
334 pages
Fun With Python - Hubert Piotrowski
No ratings yet
Fun With Python - Hubert Piotrowski
587 pages
LLaMa Model Hallucination Analysis
No ratings yet
LLaMa Model Hallucination Analysis
3 pages
AI & LLMs: A Developer's Guide
100% (1)
AI & LLMs: A Developer's Guide
2 pages
Lecture+Notes Intro To MLOps Session3
No ratings yet
Lecture+Notes Intro To MLOps Session3
8 pages
You Exec - Family Budget Planner Free
No ratings yet
You Exec - Family Budget Planner Free
96 pages
Concept Drift in Large Language Models - Ketan Sanjay Desale
No ratings yet
Concept Drift in Large Language Models - Ketan Sanjay Desale
183 pages
Fluentzy Book-14 Fluency in Functional English - Part II (Kev Nair) (Z-Library)
No ratings yet
Fluentzy Book-14 Fluency in Functional English - Part II (Kev Nair) (Z-Library)
101 pages
Llms and Generative Ai For Healthcare (Early Release) Kerrie Holley PDF Download
No ratings yet
Llms and Generative Ai For Healthcare (Early Release) Kerrie Holley PDF Download
156 pages
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
100% (1)
Planet, Code - PYTHON For LARGE LANGUAGE MODELS - A Beginners Handbook For Leveraging Llms Into Modern Development Workflows and Applications (2025)
254 pages
GALLM Unit 5 Note
No ratings yet
GALLM Unit 5 Note
7 pages
Techniques, Tricks & Frameworks
No ratings yet
Techniques, Tricks & Frameworks
143 pages
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
100% (3)
Llama3, LangGraph and Elasticsearch - Build A Local Agent For Vector Search - Search Labs
48 pages
Project Seminar
No ratings yet
Project Seminar
12 pages
LLM Mastery Pathways
No ratings yet
LLM Mastery Pathways
8 pages
Voice Based System Assistant Using NLP and Deep Learning
No ratings yet
Voice Based System Assistant Using NLP and Deep Learning
63 pages
Chat Bot
No ratings yet
Chat Bot
5 pages
ChatGPT Ebook 5th Edition FINAL
No ratings yet
ChatGPT Ebook 5th Edition FINAL
53 pages
Final Report
No ratings yet
Final Report
83 pages
HR Trends For 2020 - Future of Human Resource Management
No ratings yet
HR Trends For 2020 - Future of Human Resource Management
22 pages
Best Artificial Intelligence Course in Coimbatore
No ratings yet
Best Artificial Intelligence Course in Coimbatore
5 pages
Director of GTM Operations - Position Description
No ratings yet
Director of GTM Operations - Position Description
4 pages
Flyin-Case Study
No ratings yet
Flyin-Case Study
2 pages
Chatbot Ticketing for Museums
No ratings yet
Chatbot Ticketing for Museums
6 pages
Introduction To Industrial Management
No ratings yet
Introduction To Industrial Management
9 pages
Conversational AI Unleashed: A Comprehensive Review of NLP-Powered Chatbot Platforms
No ratings yet
Conversational AI Unleashed: A Comprehensive Review of NLP-Powered Chatbot Platforms
8 pages
The Ultimate Guide To ChatGPT - 2024 UK
100% (2)
The Ultimate Guide To ChatGPT - 2024 UK
132 pages
Class X - A.I. - Practical Lab Manual - AY 2024-25
No ratings yet
Class X - A.I. - Practical Lab Manual - AY 2024-25
29 pages
NASSCOM AI Top 50 Use Cases Final PDF
No ratings yet
NASSCOM AI Top 50 Use Cases Final PDF
85 pages
Q424 Amansob Introduction
No ratings yet
Q424 Amansob Introduction
41 pages
ChatGPT's Role in Content Marketing
No ratings yet
ChatGPT's Role in Content Marketing
5 pages
Gyaansaar
No ratings yet
Gyaansaar
3 pages
Future of Legal Profession in The Times of Artificial Intelligence
No ratings yet
Future of Legal Profession in The Times of Artificial Intelligence
6 pages
274 42 PB
No ratings yet
274 42 PB
136 pages
1 - Introduction TO NLP
100% (1)
1 - Introduction TO NLP
46 pages
MinistryOrganization NameStudent Innovation Student Innovation PS
No ratings yet
MinistryOrganization NameStudent Innovation Student Innovation PS
4 pages
Google Cloud GenAI Introduction
No ratings yet
Google Cloud GenAI Introduction
52 pages
All Products 2024
No ratings yet
All Products 2024
4 pages
A Developer's Guide To Building AI Applications: Second Edition
100% (1)
A Developer's Guide To Building AI Applications: Second Edition
43 pages
KKK 1
No ratings yet
KKK 1
6 pages
12 AI and Machine Learning Use Cases in ITSM
No ratings yet
12 AI and Machine Learning Use Cases in ITSM
11 pages
WaBotick Brochure
No ratings yet
WaBotick Brochure
24 pages
HS Unit 6. Artificial Intelligence
No ratings yet
HS Unit 6. Artificial Intelligence
7 pages
PeakMind Deck
No ratings yet
PeakMind Deck
25 pages
ChatGPT's Impact on Hospitality
No ratings yet
ChatGPT's Impact on Hospitality
11 pages