0% found this document useful (0 votes)

18 views15 pages

DL Assignment 2 Final

This project presents a deep learning-based News Category Classifier that utilizes Generative AI to enhance content understanding and user interaction. The classifier, built on the DistilBERT model and integrated with a Streamlit UI, accurately categorizes news articles while providing human-like explanations for its predictions. This innovative approach aims to improve transparency and engagement, making it suitable for various applications in journalism and content moderation.

Uploaded by

avasanth081

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views15 pages

DL Assignment 2 Final

Uploaded by

avasanth081

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

SNS COLLEGE OF TECHNOLOGY

(An Autonomous Institution)

DEEP LEARNING ASSIGNMENT PHASE - II

NEWS CATEGORY CLASSIFER PREDICTOR USING GEN AI

NAME: VASANTH.A

DEPT: III AIML B

REG NO. : 713522AM114

ABSTRACT

This project presents a deep learning-powered News Category Classifier

enhanced with Generative AI capabilities for improved content understanding
and user interaction. Traditional classifiers often label articles without offering
deeper context. To address this, we utilize the distilbert-base-uncased
transformer model fine-tuned on a labeled news dataset, enabling accurate
categorization across domains such as sports, politics, business, and technology.
Integrated with an interactive Streamlit UI, users can input or paste any news
article, and the app processes the text through the model to predict its category.
To further enhance interpretability, the app optionally generates human-like
summaries or reasoning using a lightweight LLM accessed via the Hugging Face
API or Ollama CLI. This hybrid approach not only ensures high classification
accuracy but also bridges the explainability gap, making it ideal for media
houses, content aggregators, and academic research platforms.
INTRODUCTION

In an era where information is generated at an unprecedented pace, organizing

and categorizing news content efficiently has become a critical challenge for
media platforms and content aggregators. While traditional machine learning
models can classify articles into categories like politics, sports, or business, they
often fail to provide insights into why a certain classification was made. This
lack of explainability reduces trust and limits the system’s usability for editorial
teams and end-users.

This project addresses that gap using Generative AI. By fine-tuning a

transformer-based language model (DistilBERT) on labeled news datasets, and
optionally integrating a large language model (e.g., LLaMA 2 or Mistral via
Hugging Face), we enable the system not only to classify articles accurately but
also to justify its decisions in natural language. This approach enhances
transparency, improves user engagement, and makes the classifier suitable for
real-world deployment in journalism, academic research, and content
moderation.
PROCESS FOR IDENTIFICATION OF PROBLEM
STATEMENT

The following steps were followed:

❖ Domain Analysis:

➢ We examined existing news classification systems and observed

that while many models can label articles accurately, they lack
transparency and offer no rationale behind their decisions,
reducing user trust and engagement.

❖ Dataset Acquisition:

➢ We used a labeled dataset containing news headlines and article

content tagged with categories such as politics, sports, technology,
business, and more. This dataset enabled us to train and evaluate
the classifier on diverse real-world examples.

❖ Problem Framing:

➢ The key question became:

“Can we develop a news classifier that not only categorizes articles
correctly but also explains the reasoning behind each classification
using human-like language?”

❖ Model Choice Justification:

➢ DistilBERT, a lightweight and efficient transformer model,

was selected for its strong performance on text classification
tasks. It was further enhanced using a generative model to
generate natural language explanations.
ARCHITECTURE
STAGES OF DEVELOPMENT

Stage 1: Data Preprocessing

• Collected and cleaned news category dataset (e.g., AG

News, BBC News, etc.)

• Transformed each record into natural language prompts

(e.g., “Classify the following news headline: ‘Stocks fall
amid inflation fears.’”)

• Split the dataset into training and testing sets for model
evaluation

Stage 2: Fine-Tuning the LLaMA or Mistral

Model

• Utilized Hugging Face’s transformers and datasets

libraries

• Fine-tuned with LoRA (Low-Rank Adaptation) for

efficient training

• Model options:
meta-llama/Llama-2-7b-chat-hf or
mistralai/Mistral-7B-Instruct

Instruction tuning format used:

json
Copy
Edit
{
"prompt": "Classify this news: 'The Prime Minister addressed
the nation regarding economic reforms...'",
"response": "Politics. This is related to government and
public administration."
}
Stage 3: Frontend UI Development

• Designed using Streamlit

• Users input headlines/articles via st.text_input() or

st.text_area()

• Prompts are dynamically constructed and passed to the

model backend

Stage 4: Model Invocation

• Prompt sent to Ollama CLI using:

ollama run news-classifier-model

• Model generates prediction + explanation in human-

readable format

Stage 5: Integration & Optimization

• Added @st.cache_resource to optimize model loading

• Integrated loading indicators, clean interface layout, and

category-wise color tags for better UX
PLATFORMS AND TOOLS INCORPORATED

1. Python

● Role: Backbone of the entire project

● Why Used: Python is the preferred language for AI/ML due to its
simplicity, extensive libraries, and vibrant ecosystem.
● Usage:
○ Data preprocessing (pandas, numpy)
○ Model training and prompt engineering
○ Streamlit app development
○ Interfacing with Ollama CLI

2. Streamlit

● Role: Frontend user interface

● Why Used: Streamlit is a fast and easy way to build web apps for
machine learning and data science projects.
● Features Used:
○ st.slider() and st.selectbox() for collecting user
inputs
○ st.chat_message() to simulate a ChatGPT-like interaction

○ st.spinner() to show loading status during inference

● Outcome: Created a clean, interactive UI where users provide loan
application data and receive conversational feedback from the AI model.
3. Hugging Face Transformers

● Role: Model fine-tuning and inference pipeline

● Why Used: Hugging Face provides tools for loading, training, and
deploying large language models like LLaMA.
● Usage:
○ Downloading the base meta-llama/Llama-2-7b-chat-hf
model
○ Converting structured data into instruction-format prompts
○ Fine-tuning the model with domain-specific data
○ Creating inference pipelines to integrate with the UI

4. Ollama CLI

● Role: Lightweight runtime to serve the LLaMA model locally

● Why Used: Traditional deployment of LLaMA models requires heavy
GPU servers. Ollama simplifies this by offering containerized, fast local
inference.
● Usage:
○ Hosting the fine-tuned LLaMA model using a Modelfile
○ Responding to prompts sent from the Streamlit app

Command Example:
ollama run llama-custom-model

5. CUDA / GPU (Optional but Recommended)

● Role: Hardware acceleration for training large models

● Why Used: Fine-tuning a model like LLaMA-2-7B requires high memory
and compute, which CPUs can’t handle efficiently.
● Toolkits: NVIDIA CUDA, cuDNN
● Outcome: Enabled faster training during the model fine-tuning phase

6. Pandas

● Role: Data analysis and preprocessing

● Why Used: Easy handling of structured data in tabular format (CSV
files).
● Usage:
○ Read loan_data_with_cibil.csv
○ Cleaned and transformed categorical variables (e.g., gender,
employment status)
○ Merged features into training prompts

7. Scikit-learn

● Role: Data preparation and utility functions

● Why Used:
○ Used for splitting the dataset into training and testing sets
(train_test_split)
○ Label encoding for categorical features (if needed)

8. LoRA (Low-Rank Adaptation)

● Role: Parameter-efficient fine-tuning

● Why Used: Fine-tuning large models like LLaMA from scratch is
memory-intensive. LoRA reduces the number of trainable parameters.
● Library: peft from Hugging Face ecosystem
● Outcome: Reduced training cost and made fine-tuning feasible on a
mid-tier GPU

9. Transformers Datasets

● Role: Efficient data handling for model training

● Why Used:
○ Compatible with Hugging Face training pipeline
○ Fast I/O and in-memory caching for better training performance
● Usage:
○ Loaded prompt-response pairs in a format suitable for training the
LLaMA model

10. Modelfile (Ollama Specific)

● Role: Configuration file for serving LLaMA models using Ollama

● Why Used: Specifies base model, adapter files, and prompt formatting

Example:
FROM llama2

ADAPTER my-fine-tuned-model.bin

SYSTEM "You are a helpful news category classifier"

OUTCOME

INPUT
OUTPUT
SOURCE CODE

Generative AI With Python - Bert Gollnick
100% (2)
Generative AI With Python - Bert Gollnick
708 pages
Projects
No ratings yet
Projects
8 pages
Data Science & Data Analytics Project - Documentation
No ratings yet
Data Science & Data Analytics Project - Documentation
10 pages
Ai 1
No ratings yet
Ai 1
22 pages
Fake News Detection
100% (1)
Fake News Detection
25 pages
An Effective Query System Using Llms and Langchain IJERTV12IS060161
No ratings yet
An Effective Query System Using Llms and Langchain IJERTV12IS060161
4 pages
ML Interview Ke Pehle Padhna Hai
No ratings yet
ML Interview Ke Pehle Padhna Hai
59 pages
SocrAI Day 3
No ratings yet
SocrAI Day 3
43 pages
AI & Data Science Enthusiast Profile
No ratings yet
AI & Data Science Enthusiast Profile
1 page
Performance Analysis of LoRA Finetuning Llama-2
No ratings yet
Performance Analysis of LoRA Finetuning Llama-2
4 pages
Pranshi Singla IX C AI Activity 1
No ratings yet
Pranshi Singla IX C AI Activity 1
24 pages
AI & Machine Learning Overview
No ratings yet
AI & Machine Learning Overview
38 pages
AI & ML Trends 2023: Key Opportunities
No ratings yet
AI & ML Trends 2023: Key Opportunities
16 pages
Introduction (BT4222) YL
No ratings yet
Introduction (BT4222) YL
48 pages
Report in ML
No ratings yet
Report in ML
9 pages
Final Presentation
No ratings yet
Final Presentation
22 pages
UNIT IV Lecture Notes Covering Natural Language Processing
No ratings yet
UNIT IV Lecture Notes Covering Natural Language Processing
6 pages
AI Professional Workshop
No ratings yet
AI Professional Workshop
32 pages
Practice IV - Second Stage - Clickbait Detection
No ratings yet
Practice IV - Second Stage - Clickbait Detection
10 pages
GenAI LLM Foundations and Building Blocks
No ratings yet
GenAI LLM Foundations and Building Blocks
6 pages
LLaMA Ankit - Rawat
No ratings yet
LLaMA Ankit - Rawat
52 pages
What Is Natural Language Processing (NLP)
No ratings yet
What Is Natural Language Processing (NLP)
15 pages
Hands-On Large Language Models
No ratings yet
Hands-On Large Language Models
59 pages
Major Complete Presentation - Major Project Presentation.
No ratings yet
Major Complete Presentation - Major Project Presentation.
28 pages
Unit-5 (DL For Different Domains, Role of GPUs and DL Frameworks)
No ratings yet
Unit-5 (DL For Different Domains, Role of GPUs and DL Frameworks)
15 pages
Transformers
No ratings yet
Transformers
27 pages
Transformers For Natural Language Processing and Computer Vision
No ratings yet
Transformers For Natural Language Processing and Computer Vision
150 pages
Building A Large Language Model LLM From Scratch
No ratings yet
Building A Large Language Model LLM From Scratch
13 pages
OceanofPDF - Com Large Language Models Concepts - John AtkinsonAbutridy
No ratings yet
OceanofPDF - Com Large Language Models Concepts - John AtkinsonAbutridy
185 pages
Large Language Models For Information Management - 01 - Modulo Base (MB) - 4pdf
No ratings yet
Large Language Models For Information Management - 01 - Modulo Base (MB) - 4pdf
68 pages
EL4106Intro 2024
No ratings yet
EL4106Intro 2024
69 pages
Generative Models for Beginners
No ratings yet
Generative Models for Beginners
17 pages
03 NLP Document
No ratings yet
03 NLP Document
38 pages
Natural Language Processing
No ratings yet
Natural Language Processing
8 pages
Hands On Prompt Engineering Final 1750086965952
No ratings yet
Hands On Prompt Engineering Final 1750086965952
69 pages
Twitter Sentiment Analysis Using Python TweetX
No ratings yet
Twitter Sentiment Analysis Using Python TweetX
3 pages
Generative AI NLP Bootcamp
No ratings yet
Generative AI NLP Bootcamp
17 pages
Generative AI Roadmap
No ratings yet
Generative AI Roadmap
36 pages
Thesis Darius Dragnea
No ratings yet
Thesis Darius Dragnea
64 pages
Conference FakeNews Detection
No ratings yet
Conference FakeNews Detection
6 pages
LLMs in Python Free Course by Inder P Singh
No ratings yet
LLMs in Python Free Course by Inder P Singh
28 pages
NLP Techniques for ML Experts
No ratings yet
NLP Techniques for ML Experts
97 pages
Project List - Applied DS, ML & AI 9 Months
No ratings yet
Project List - Applied DS, ML & AI 9 Months
12 pages
Navigating The AI Frontier An Internship Journey
No ratings yet
Navigating The AI Frontier An Internship Journey
11 pages
Summer Course Material
No ratings yet
Summer Course Material
52 pages
14 LookingForward
No ratings yet
14 LookingForward
48 pages
03 GenAI Intro
No ratings yet
03 GenAI Intro
13 pages
Fake News Detection Using NLP
No ratings yet
Fake News Detection Using NLP
6 pages
Icrcct24 001
No ratings yet
Icrcct24 001
6 pages
Virtual Agent Chatbot Using Open Artificial Intelligence Final
No ratings yet
Virtual Agent Chatbot Using Open Artificial Intelligence Final
16 pages
Project Seminar
No ratings yet
Project Seminar
12 pages
Own Your AI - Tech Deck
No ratings yet
Own Your AI - Tech Deck
75 pages
CS480 Lecture November 28th
No ratings yet
CS480 Lecture November 28th
96 pages
Prompting and Fine-Tuning Pre-Trained Generative Language Models
No ratings yet
Prompting and Fine-Tuning Pre-Trained Generative Language Models
4 pages
Projects GenAI Pinnacle Program
No ratings yet
Projects GenAI Pinnacle Program
14 pages
Talking Points
No ratings yet
Talking Points
8 pages
LLM - Introduction 2024
No ratings yet
LLM - Introduction 2024
77 pages
Understanding GPT for Tech Enthusiasts
No ratings yet
Understanding GPT for Tech Enthusiasts
30 pages
GIZMORE Smartwatches
No ratings yet
GIZMORE Smartwatches
7 pages
Pic18 (L) f2x4xk22 - Instruction Set
No ratings yet
Pic18 (L) f2x4xk22 - Instruction Set
49 pages
MBC-23014 Product Catalog - WEB v1.1
No ratings yet
MBC-23014 Product Catalog - WEB v1.1
16 pages
eGPU Laptop Candidate List (Only Thunderbolt 3 and Faster) PDF
No ratings yet
eGPU Laptop Candidate List (Only Thunderbolt 3 and Faster) PDF
10 pages
eComStation 2.1 Installation - Quick Guide
No ratings yet
eComStation 2.1 Installation - Quick Guide
75 pages
PLC Basics and Advantages
No ratings yet
PLC Basics and Advantages
51 pages
Curriculum Vitae: Sanju C
No ratings yet
Curriculum Vitae: Sanju C
4 pages
JBL - Bar 2.0 - All in One Owner's Manual - EN
No ratings yet
JBL - Bar 2.0 - All in One Owner's Manual - EN
11 pages
Battle Gear 3
No ratings yet
Battle Gear 3
161 pages
Manual Reliance Approach Flash System
No ratings yet
Manual Reliance Approach Flash System
84 pages
Tax Invoice for Apple M3 Chip
No ratings yet
Tax Invoice for Apple M3 Chip
1 page
Hi3516Dv300 - Stpsom Datasheet: /product Display
No ratings yet
Hi3516Dv300 - Stpsom Datasheet: /product Display
8 pages
A Brief Overview of Xilinx Alveo
No ratings yet
A Brief Overview of Xilinx Alveo
16 pages
Dell Inc.: Spec CPU2017 Integer Rate Result
No ratings yet
Dell Inc.: Spec CPU2017 Integer Rate Result
8 pages
Pgdca 111 It Notes
No ratings yet
Pgdca 111 It Notes
18 pages
Embedded Systems
No ratings yet
Embedded Systems
116 pages
Computer Tech & Performance Basics
No ratings yet
Computer Tech & Performance Basics
46 pages
Intel MP Architecture & Assembly Programming
No ratings yet
Intel MP Architecture & Assembly Programming
16 pages
Red Hat Virtualization-4.3-Installing Red Hat Virtualization As A Self-Hosted Engine Using The Cockpit Web interface-en-US
No ratings yet
Red Hat Virtualization-4.3-Installing Red Hat Virtualization As A Self-Hosted Engine Using The Cockpit Web interface-en-US
84 pages
Eight Bit Issue 12 RGB
No ratings yet
Eight Bit Issue 12 RGB
52 pages
Subject: ZX-5-All-210
No ratings yet
Subject: ZX-5-All-210
5 pages
Unit 1 Lesson 3 The Underlying Concepts in Computer Software
No ratings yet
Unit 1 Lesson 3 The Underlying Concepts in Computer Software
11 pages
Chapter 1 - AVR Microcontrollers
No ratings yet
Chapter 1 - AVR Microcontrollers
36 pages
DaVinci - The ChatGPT Virtual Assistant Instructions
No ratings yet
DaVinci - The ChatGPT Virtual Assistant Instructions
11 pages
Mountbatten Brailler Service Guide
No ratings yet
Mountbatten Brailler Service Guide
102 pages
1 Introduction
No ratings yet
1 Introduction
18 pages
Urschel CC - Manual - English - 2355 - Adndm - 2533 - Ches
No ratings yet
Urschel CC - Manual - English - 2355 - Adndm - 2533 - Ches
206 pages
Promise Pegasus Driver Installation Guide On M1 Mac v1
No ratings yet
Promise Pegasus Driver Installation Guide On M1 Mac v1
11 pages
How To Configure GMPI A6V10372787 - en
No ratings yet
How To Configure GMPI A6V10372787 - en
4 pages
Tutorial - Using The USBDM
100% (1)
Tutorial - Using The USBDM
24 pages