Presentation LLM

Uploaded by

Somyajit Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

65 views5 pages

Presentation LLM

Uploaded by

Somyajit Chakraborty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Implementation of an LLM-based Chatbot with

RAG Structure
Somyajit Chakraborty
University College Cork
Research Assistant
October 22, 2024

Abstract
This paper presents the structure and implementation of a Large Lan-
guage Model (LLM)-based chatbot that utilizes Retrieval-Augmented Gen-
eration (RAG) agents for geospatial tasks and emotional interaction. The
chatbot is designed to guide users along scenic or optimal walking paths
while providing personalized and emotionally supportive responses based
on real-time input. We also explain how to integrate this LLM system
with an Android application. The design is inspired by recent advance-
ments in geospatial LLMs, including the GeoGPT framework [1].

1 Introduction
The integration of Large Language Models (LLMs) into mobile applications
for real-time geospatial tasks is a rapidly advancing field. Retrieval-Augmented
Generation (RAG) models have recently gained popularity for optimizing search-
based operations, improving the performance of LLMs in handling user queries
[2]. This paper outlines the structure of an LLM-based chatbot that uses RAG
agents to provide route guidance, retrieve points of interest (POIs), and offer
emotionally supportive interaction with users.
We aim to demonstrate how this system can be implemented in an Android
application by leveraging Docker to host the LLM backend. The design closely
follows the GeoGPT structure [1] and builds upon recent developments in LLMs
[4].

2 System Structure
The system comprises three main components:
• RAG Agents: Responsible for retrieving relevant information from map
databases and POIs.

1
• LLM-based Emotional Interaction: Handles real-time emotionally
supportive responses to user inputs.
• Android Application: The front-end for users to interact with the chat-
bot, request routes, and receive POI or emotional responses.
The system’s core workflow involves classifying the user’s input (e.g., route
requests, POI queries, or emotional statements) and using RAG agents to re-
trieve relevant data efficiently. The Android application connects to the backend
server, where the LLM processes the data and generates responses.

User Input

Input Classification

Routing Agent POI Agent

Emotional Agent

Response

System Structure of the LLM-based Chatbot

3 RAG Structure and Agents

The RAG structure enhances the efficiency of LLMs by retrieving the most
relevant data from external sources before generating responses. In this chatbot,
three RAG agents are employed:
• Routing Agent: Uses user preferences (e.g., shortest or scenic paths) to
retrieve optimal routes from a geospatial database.
• POI Agent: Fetches information about points of interest located along
the user’s route.

2
• Emotional Agent: Classifies emotional inputs and generates empathetic
responses using LLM models.

By using these RAG agents, the chatbot can reduce the search space and
retrieve relevant responses more efficiently.

User Query

Search Space

RAG Agents

LLM Generation

Output Response

RAG Structure for Route and Emotional Interaction

4 Backend Implementation
The backend server for the chatbot is implemented using Python, FastAPI, and
Docker. The LLM models, such as LLaMA or Mistral7 [3], are hosted within
Docker containers, allowing for easy deployment and scalability.

4.1 Backend Components

• LLM Model: Hugging Face models like LLaMA or Mistral7 are used for
generating responses to user inputs.

• API Server: FastAPI is used to expose endpoints for retrieving routes,

POIs, and emotional interactions.

3
• Docker: The entire backend is containerized for easy deployment across
different environments.

The Android application interacts with this backend by sending HTTP re-
quests, which are processed by the API server. The backend retrieves the nec-
essary data using RAG agents and LLM models, and returns a response to the
Android app.

Android App

HTTP Request

API Server (FastAPI)

Dockerized LLM

HTTP Response

Backend Connectivity with Android Application

5 Conclusion
This paper presented the design and implementation of an LLM-based chatbot
integrated with RAG agents for geospatial and emotional interaction tasks. By
utilizing Docker and FastAPI, the backend can be easily deployed and scaled
across various platforms. The chatbot provides users with personalized walking
routes, POI information, and emotionally supportive responses, making it a
robust tool for real-time interaction. Future work will focus on optimizing model
performance for mobile environments and integrating additional features such
as voice interaction and weather-based route suggestions.

4
References
[1] Zhang, Y., et al. (2023). GeoGPT: Understanding and Processing Geospatial
Tasks through An Autonomous GPT. arXiv preprint arXiv:2307.07930.
[2] Lewis, P., et al. (2020). Retrieval-Augmented Generation for Knowledge-
Intensive NLP Tasks. Advances in Neural Information Processing Systems.
[3] Hugging Face. (2023). LLaMA and Mistral Models. Retrieved from https:
//huggingface.co/models.
[4] Touvron, H., et al. (2023). LLaMA: Open and Efficient Foundation Language
Models. arXiv preprint arXiv:2302.13971.

Artificial Intelligence Module Wise Notes
No ratings yet
Artificial Intelligence Module Wise Notes
65 pages
Chapter 3 (Artificial Intelligence (AI) )
100% (1)
Chapter 3 (Artificial Intelligence (AI) )
34 pages
Artificial - Intelligence - Presentation (FINAL)
No ratings yet
Artificial - Intelligence - Presentation (FINAL)
23 pages
Building Multi-Tier Web Applications in Virtual Environments
100% (1)
Building Multi-Tier Web Applications in Virtual Environments
30 pages
AI Seminar Report for B.Tech CSE
No ratings yet
AI Seminar Report for B.Tech CSE
24 pages
What Is An AI Agent
No ratings yet
What Is An AI Agent
4 pages
Installation Jenkin and Docker and Kubernetes
No ratings yet
Installation Jenkin and Docker and Kubernetes
4 pages
Docker: How To Setup Docker & Run Apps in Docker
No ratings yet
Docker: How To Setup Docker & Run Apps in Docker
7 pages
Resume Lai Shih-Yu
No ratings yet
Resume Lai Shih-Yu
1 page
Docker & Kubernetes Deployment Guide
No ratings yet
Docker & Kubernetes Deployment Guide
3 pages
Introduction To Kubernetes
No ratings yet
Introduction To Kubernetes
15 pages
Alpine Linux Docker Setup Guide
No ratings yet
Alpine Linux Docker Setup Guide
3 pages
Tutorials - ? ? LangChain
No ratings yet
Tutorials - ? ? LangChain
2 pages
Docker - Kubernetes Readme
No ratings yet
Docker - Kubernetes Readme
10 pages
Setup Kubernetes
No ratings yet
Setup Kubernetes
3 pages
Docker Desktop's Kubernetes Setup
No ratings yet
Docker Desktop's Kubernetes Setup
6 pages
Kubernetes Vs Docker A Quick Comparison
No ratings yet
Kubernetes Vs Docker A Quick Comparison
5 pages
Jenkins Maven Integration Guide
No ratings yet
Jenkins Maven Integration Guide
16 pages
Docker Desktop's Kubernetes Setup
No ratings yet
Docker Desktop's Kubernetes Setup
4 pages
Cloud Computing Lab Manual
No ratings yet
Cloud Computing Lab Manual
30 pages
New Text Document
No ratings yet
New Text Document
41 pages
Microservices .NET & K8S Cheat Sheet
No ratings yet
Microservices .NET & K8S Cheat Sheet
1 page
Linux Commands - Mkdir - Rmdir - Touch - RM - CP - More - Less - Head - Tail - Cat
No ratings yet
Linux Commands - Mkdir - Rmdir - Touch - RM - CP - More - Less - Head - Tail - Cat
16 pages
Maven Tour
No ratings yet
Maven Tour
44 pages
Unit 1
No ratings yet
Unit 1
19 pages
Run Docker Containers On Windows Server 2019 - ComputingForGeeks
No ratings yet
Run Docker Containers On Windows Server 2019 - ComputingForGeeks
1 page
Linux Crash Course For Beginners - Kodecloud
0% (1)
Linux Crash Course For Beginners - Kodecloud
270 pages
Top Unix Commands: Ls Ls - L Ls - A Ls - F Ls - Al Ls /home/pjh503
No ratings yet
Top Unix Commands: Ls Ls - L Ls - A Ls - F Ls - Al Ls /home/pjh503
10 pages
Shell Basics
No ratings yet
Shell Basics
28 pages
Installation - Alpine Linux
No ratings yet
Installation - Alpine Linux
10 pages
Codility Lessons
No ratings yet
Codility Lessons
48 pages
POC For LLM Pipeline
No ratings yet
POC For LLM Pipeline
18 pages
Module 10 - Docker Compose
No ratings yet
Module 10 - Docker Compose
22 pages
Build, Deploy & Operate Intelligent Chatbots With Amazon Lex
No ratings yet
Build, Deploy & Operate Intelligent Chatbots With Amazon Lex
28 pages
Top 15 Docker Commands - Docker Commands Tutorial
No ratings yet
Top 15 Docker Commands - Docker Commands Tutorial
5 pages
AI PoweredTestingTools
No ratings yet
AI PoweredTestingTools
13 pages
Docker Basics for Developers
No ratings yet
Docker Basics for Developers
4 pages
Docker and Kubernetes
No ratings yet
Docker and Kubernetes
12 pages
LLM Test Case Generation for Software
No ratings yet
LLM Test Case Generation for Software
6 pages
Cloud Computing Module-05 Search Creators
100% (1)
Cloud Computing Module-05 Search Creators
25 pages
What Is Computer Programmers Programming?
No ratings yet
What Is Computer Programmers Programming?
23 pages
Devops Lab Manual Programs
No ratings yet
Devops Lab Manual Programs
26 pages
Vagrant Cheat Sheet + Get Started With Vagrant
No ratings yet
Vagrant Cheat Sheet + Get Started With Vagrant
6 pages
Linux Training for Government Officials
No ratings yet
Linux Training for Government Officials
45 pages
Hugging Face Case Study 112023
No ratings yet
Hugging Face Case Study 112023
2 pages
5-Day Gen AI Intensive Course 2024 November 11-15 (Full)
No ratings yet
5-Day Gen AI Intensive Course 2024 November 11-15 (Full)
347 pages
Docker Tutorial PDF
No ratings yet
Docker Tutorial PDF
99 pages
Reading:: Sources
No ratings yet
Reading:: Sources
15 pages
Microsoft Azure DevOps Engineer Certification Training
No ratings yet
Microsoft Azure DevOps Engineer Certification Training
12 pages
AI Engineer Resume
No ratings yet
AI Engineer Resume
2 pages
LLM Roadmap
No ratings yet
LLM Roadmap
23 pages
Docker
No ratings yet
Docker
6 pages
A Short History of Computing: CS 1 With Robots
No ratings yet
A Short History of Computing: CS 1 With Robots
39 pages
Base 64 Report
No ratings yet
Base 64 Report
30 pages
LLM Chaining & Indexing Workshop
No ratings yet
LLM Chaining & Indexing Workshop
19 pages
Ansible Notes
No ratings yet
Ansible Notes
15 pages
I Think Unix
No ratings yet
I Think Unix
299 pages
2024 04 25 AI Bots Vitalii
No ratings yet
2024 04 25 AI Bots Vitalii
20 pages
RAG Chatbot Presentation
No ratings yet
RAG Chatbot Presentation
12 pages
Top 75 SAP BTP Interview Questions and Answers 1733208140
No ratings yet
Top 75 SAP BTP Interview Questions and Answers 1733208140
77 pages
Smartconnector User'S Guide: Topics Applicable To All Arcsight Smartconnectors
No ratings yet
Smartconnector User'S Guide: Topics Applicable To All Arcsight Smartconnectors
132 pages
Public Domain Book Digitization Guide
No ratings yet
Public Domain Book Digitization Guide
575 pages
DBMS Question Bank PDF
No ratings yet
DBMS Question Bank PDF
10 pages
Snowflake Schema Explained
No ratings yet
Snowflake Schema Explained
8 pages
Dolphin Process Tracking For SAP Order Management
No ratings yet
Dolphin Process Tracking For SAP Order Management
2 pages
CS408 Solved MCQs by Kaami
No ratings yet
CS408 Solved MCQs by Kaami
7 pages
Release NotesMagnifiGO 5.3R3
No ratings yet
Release NotesMagnifiGO 5.3R3
3 pages
Blood Bank Management System (Python&MySQL)
No ratings yet
Blood Bank Management System (Python&MySQL)
16 pages
Jenkins PPT
100% (3)
Jenkins PPT
24 pages
Rev 1 CS
No ratings yet
Rev 1 CS
9 pages
CP R77.20.20 1430 1450 ApplianceLocal AdminGuide
No ratings yet
CP R77.20.20 1430 1450 ApplianceLocal AdminGuide
174 pages
AJU Brochure
No ratings yet
AJU Brochure
36 pages
Ubuntu Snort Install Guide 2903
No ratings yet
Ubuntu Snort Install Guide 2903
12 pages
3D Modeling for Decentraland Users
No ratings yet
3D Modeling for Decentraland Users
10 pages
E-Business Platform/Infrastructure For Daraz: ERP CRM SCM Data Mining
No ratings yet
E-Business Platform/Infrastructure For Daraz: ERP CRM SCM Data Mining
3 pages
CIS Apache Tomcat 9 Benchmark v1.1.0
No ratings yet
CIS Apache Tomcat 9 Benchmark v1.1.0
127 pages
Belgacem Gmiden: Full Stack Developer JAVA/JEE & Angular (+5 Y) Summary
No ratings yet
Belgacem Gmiden: Full Stack Developer JAVA/JEE & Angular (+5 Y) Summary
6 pages
AWS Technical Essentials Course
No ratings yet
AWS Technical Essentials Course
2 pages
Hyperledger Fabric Setup Guide
No ratings yet
Hyperledger Fabric Setup Guide
13 pages
3.8.8 Lab - Explore DNS Traffic - ILM
No ratings yet
3.8.8 Lab - Explore DNS Traffic - ILM
10 pages
Blockchain Technology
No ratings yet
Blockchain Technology
2 pages
Clinic Management System
No ratings yet
Clinic Management System
9 pages
DataStage 8.5 Aggregation Guide
No ratings yet
DataStage 8.5 Aggregation Guide
22 pages
Continuous Delivery Pipeline Using Terraform, Jenkins, Github and Deploy On AWS
No ratings yet
Continuous Delivery Pipeline Using Terraform, Jenkins, Github and Deploy On AWS
14 pages
PHP Syllabus
No ratings yet
PHP Syllabus
2 pages
Wartsila Navi Planner Brochure
100% (1)
Wartsila Navi Planner Brochure
6 pages
2025-26 Cse-Ds Time Table (Non-Training Session)
No ratings yet
2025-26 Cse-Ds Time Table (Non-Training Session)
23 pages
Unit - 2
No ratings yet
Unit - 2
89 pages
Application Software in Carson Cumberbatch
No ratings yet
Application Software in Carson Cumberbatch
10 pages