CodeSense 1

CodeSense is an AI-powered software code analyzer designed to automate code quality inspection and provide intelligent feedback, addressing the challenges of complex codebases and manual reviews. It integrates Natural Language Processing, Machine Learning, and static analysis to detect errors and suggest corrective measures, improving code readability, maintainability, and security. The system aims to enhance developer productivity, reduce debugging costs, and promote clean coding practices in both academia and industry.

Uploaded by

tanishqkumar467

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views8 pages

CodeSense 1

Uploaded by

tanishqkumar467

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

A Synopsis on

CodeSense: AI Software Code Analyzer

Submitted in partial fulfillment of the

requirements for the award of the degree

of
Bachelor of Technology

in
Computer Science and
Engineering by

Yash Sharma (2100970100133)

Sarthak Agrawal (2200970100148)
Shivam Jaiswal (2200970100159)
Tanishq Kumar (2200970100175)

Semester – VII

Under the Supervision of

Ms. Ramandeep Kaur

Galgotias College of Engineering & Technology

Greater Noida 201306

Affiliated to

Dr. APJ Abdul Kalam Technical University, Lucknow

ABSTRACT

The rapid growth of software development has resulted in increasingly complex codebases,
making code review and quality assurance more challenging. Manual code reviews are time-
consuming, error-prone, and often fail to detect hidden issues such as subtle bugs, code
smells, and maintainability problems. To address this gap, we propose CodeSense, an AI-
powered software code analyser that automates code quality inspection and provides
intelligent feedback.
CodeSense integrates Natural Language Processing (NLP), Machine Learning (ML), and static
code analysis to detect errors, security vulnerabilities, and violations of coding standards.
Trained on large-scale open-source repositories, the system learns best practices and
incorporates semantic analysis for deeper insights into program logic. Unlike traditional
analysers, CodeSense not only identifies issues but also suggests corrective measures,
thereby improving code readability, maintainability, and security.
The system reduces developer workload by automating routine checks, increasing
productivity, and enhancing software reliability. With applications in both academia and
industry, CodeSense promotes clean coding practices, accelerates development cycles, and
lowers the cost of debugging and maintenance.
INTRODUCTION
Software systems are growing rapidly in scale and complexity, making it increasingly difficult
for developers to maintain clean, secure, and efficient code. Large projects often involve
millions of lines of code, multiple teams, and diverse technologies, which makes manual
code review both time-consuming and error-prone. Even with experienced reviewers, subtle
bugs, performance issues, and security vulnerabilities often remain undetected.
Traditional static analysis tools such as SonarQube, PMD, and FindBugs provide rule-based
inspections that can detect common issues, but they struggle with deeper problems like
code smells, logical flaws, and cross-module dependencies. Dynamic analysers, though more
powerful, require significant computational resources and time, making them unsuitable for
frequent use in agile and fast-paced development environments.
At the same time, the rising number of security threats in modern software—ranging from
SQL injection to cross-site scripting—demands more intelligent and adaptive approaches.
With widespread reliance on third-party libraries and open-source frameworks,
vulnerabilities can spread quickly if not identified early.
Artificial Intelligence (AI) and Machine Learning (ML) present a strong opportunity to
overcome these limitations. Unlike rule-based systems, AI can learn from large repositories
of real-world code, recognize patterns of good and bad practices, and adapt to new
programming paradigms. When combined with Natural Language Processing (NLP), such
systems can even interpret code comments and developer intent, offering richer insights
into maintainability and design quality.
The proposed system, CodeSense, addresses these challenges by combining the
deterministic power of static analysis with the adaptive intelligence of AI. Instead of merely
reporting violations of fixed rules, CodeSense learns from real-world repositories, prioritizes
issues by severity, and suggests corrective actions. This makes it more developer-friendly,
context-aware, and scalable compared to existing tools.
In summary, CodeSense aims to bridge the gap between traditional code analysers and
modern AI-driven solutions. It represents a hybrid approach that not only improves software
reliability and maintainability but also supports developers in writing cleaner, more secure,
and future-ready code.
LITERATURE SURVEY

Ensuring the quality and reliability of software has always been a primary concern in
software engineering. As software projects grow in size and complexity, manual inspection
becomes impractical, leading to the development of automated approaches for code
analysis. The literature reveals extensive research across four main areas: (i) traditional
static analysis tools, (ii) code smell detection and maintainability, (iii) AI-based methods, and
(iv) hybrid systems. Each area has contributed valuable insights, but each also presents
limitations that motivate the need for a more advanced solution such as CodeSense.

1. Traditional Static Analysis Tools

Static analysis represents one of the earliest and most widely adopted methods for code
inspection. Early tools like Lint (Johnson, 1979) focused on identifying stylistic issues and
simple programming errors in C programs. Later, tools such as FindBugs (Hovemeyer &
Pugh, 2004), Checkstyle, and PMD extended this approach to Java and other languages.
Despite their popularity, static analysers have several limitations:
 Heavy reliance on fixed rule sets, which require constant updates for new
frameworks or languages.
 Lack of semantic understanding, as tools typically check syntax rather than program
intent.

Figure 1: Workflow of a typical static analyser (Source: Parasoft)

Static analysis is effective at catching simple defects early in the development cycle, but its
limited adaptability calls for more advanced approaches.
2. Code Smell Detection and Maintainability
The concept of code smells, introduced by Fowler (1999), describes symptoms of poor
design that hinder long-term maintainability. Examples include long methods, duplicated
code, and large classes. Tools like SonarQube and PMD include modules to detect such
smells.
Researchers have contributed significantly in this area:
 Marinescu (2004) proposed metric-based detection strategies using coupling,
cohesion, and complexity indicators.
 Olbrich et al. (2010) found that classes with smells
tend to accumulate more defects during software
evolution.
 Fontana et al. (2016) validated that eliminating
smells early improves maintainability and reduces
technical debt.

Figure 2: Common code smells that impede maintainability (Source: 8th Light)
While smell detection has improved awareness of design flaws, rule-based methods struggle
with context sensitivity. For example, a “large class” may be acceptable in certain
framework libraries but harmful in business logic. This reinforces the need for context-
aware analysis.

3. AI and Machine Learning Approaches

With advances in AI, researchers have applied machine learning to improve software
analysis. Unlike static rule-based methods, AI can learn patterns from large code
repositories, making predictions about bugs, smells, or vulnerabilities.
3.1 Bug Prediction Models
 Nagappan et al. (2006) used logistic regression on code churn metrics to predict
defect-prone files.
 Kim et al. (2011) leveraged change history mining for more accurate bug prediction.
3.2 Deep Learning Representations
 Allamanis et al. (2018) introduced code2vec, a method that represents source code
as vectors for tasks like bug detection and method prediction.
 Feng et al. (2020) developed CodeBERT, trained on massive GitHub repositories,
enabling semantic understanding for vulnerability detection.
3.3 AI in Security
 Li et al. (2018) used recurrent neural networks (RNNs) to detect SQL injection
vulnerabilities.
 Russell et al. (2019) demonstrated AI-driven detection of cross-site scripting attacks.

Figure 3: Evolution of AI—from rule-based logic to deep learning models (Source:

GeeksforGeeks)

*AI models achieve higher accuracy and adaptability, but their “black box” nature reduces
developer trust, as they often lack explainability.*

4. Hybrid and Context-Aware Systems

Recognizing the weaknesses of both static and AI-only methods, researchers propose hybrid
analysers that combine rule-based analysis with machine learning.
 White et al. (2019) integrated static rules with ML classifiers, reducing false positives
by learning from developer feedback.
 Tufano et al. (2020) applied neural machine translation techniques to suggest bug
fixes, showing promising results over traditional refactoring.
 Industrial systems like Google’s Tricorder and Facebook’s Sapienz already implement
hybrid approaches in large-scale settings, though they remain proprietary.
Hybrid systems combine the explainability of static rules with the adaptability of ML models,
making them the most balanced approach.

From the reviewed literature, the following insights emerge:

 Static analysers are efficient for surface-level errors but limited in adaptability and
prone to false positives.
 Code smell research highlights maintainability issues but struggles with contextual
accuracy.
 AI-based methods excel at learning semantic patterns but lack transparency and
developer trust.
 Hybrid systems emerge as the most effective direction, blending clarity with
intelligence.

These findings support the motivation for CodeSense, which aims to implement a hybrid
analyser enhanced with AI techniques to provide context-aware, adaptive, and developer-
friendly insights. By blending the strengths of traditional rule-based approaches with
modern AI capabilities, CodeSense is positioned to minimize false positives, offer actionable
recommendations, and continuously adapt to new programming trends. Moreover, its
ability to integrate seamlessly into existing development workflows makes it practical for
real-world adoption in both industry and academia.
PROBLEM FORMULATION

Despite advances in software engineering, code quality assurance still faces two key
limitations:
1. Static tools – Rigid, rule-based, and unable to adapt to evolving frameworks.
2. AI-only tools – Powerful but opaque, often acting as “black boxes” with limited
explainability.

Thus, the research problem is:

“How can we design an AI-driven code analyser that combines the precision of static
analysis with the adaptability of machine learning to ensure higher code quality, reduced
defects, and improved maintainability?”

Significance of the Research

 For Developers: Reduces debugging effort and accelerates development cycles.
 For Organizations: Lowers maintenance costs and improves software reliability.
 For Academia: Provides a framework for applying AI in software engineering
education.
 For Security: Identifies vulnerabilities early, reducing risks of cyberattacks.

By addressing these needs, CodeSense bridges the gap between traditional analysers and
modern AI-driven systems, contributing to the advancement of intelligent software quality
assurance.

AI Based Code Review and System Research Paper
No ratings yet
AI Based Code Review and System Research Paper
6 pages
A Study Code Review in Software Development Using AI
No ratings yet
A Study Code Review in Software Development Using AI
7 pages
CodeCritique Research Paper
No ratings yet
CodeCritique Research Paper
6 pages
Research Paper
No ratings yet
Research Paper
3 pages
Week 3 Software Tools
No ratings yet
Week 3 Software Tools
37 pages
Software Defect Prediction - Final - Doc - Phase 1
No ratings yet
Software Defect Prediction - Final - Doc - Phase 1
36 pages
AI Powered Code Review Assistant
No ratings yet
AI Powered Code Review Assistant
6 pages
Challenges and Paths Towards AI For Software Engineering
No ratings yet
Challenges and Paths Towards AI For Software Engineering
76 pages
Applsci 15 04559
No ratings yet
Applsci 15 04559
24 pages
Integrating AI-Driven Automated Code Review in Agile Development: Benefits, Challenges, and Best Practices
No ratings yet
Integrating AI-Driven Automated Code Review in Agile Development: Benefits, Challenges, and Best Practices
10 pages
Python Software Fault Prediction
No ratings yet
Python Software Fault Prediction
18 pages
Sivam 219303066 Research Paper Testing 1
No ratings yet
Sivam 219303066 Research Paper Testing 1
13 pages
Applsci 15 01344
No ratings yet
Applsci 15 01344
26 pages
REVIEW1
No ratings yet
REVIEW1
17 pages
Improving Code Quality Using The Roslyn Compiler API
No ratings yet
Improving Code Quality Using The Roslyn Compiler API
43 pages
AI-Powered Code Analysis Tool
No ratings yet
AI-Powered Code Analysis Tool
78 pages
Coding Standard
No ratings yet
Coding Standard
19 pages
8th Sem Nitin Raj Sharma - Generative Ai in Software Development - Nitinrajsharma
No ratings yet
8th Sem Nitin Raj Sharma - Generative Ai in Software Development - Nitinrajsharma
8 pages
Compiler
No ratings yet
Compiler
5 pages
Survay of Programing Languages
No ratings yet
Survay of Programing Languages
37 pages
Project (1) (1)
No ratings yet
Project (1) (1)
27 pages
The Role of AI in Enhancing Software Development and Code Quality
No ratings yet
The Role of AI in Enhancing Software Development and Code Quality
3 pages
Sunaan
No ratings yet
Sunaan
28 pages
1 Development of An AI Driven Model For Advancing Software Engineering Practices
No ratings yet
1 Development of An AI Driven Model For Advancing Software Engineering Practices
11 pages
1 s2.0 S2352711024000487 Main
No ratings yet
1 s2.0 S2352711024000487 Main
8 pages
Sunaan
No ratings yet
Sunaan
28 pages
Research Report Bartes-Catalin-Razvan IS 248
No ratings yet
Research Report Bartes-Catalin-Razvan IS 248
8 pages
Usage of Machine Learning in Software Testing: Sumit Mahapatra and Subhankar Mishra
No ratings yet
Usage of Machine Learning in Software Testing: Sumit Mahapatra and Subhankar Mishra
16 pages
Software Defect Detection Using Machine Learning
No ratings yet
Software Defect Detection Using Machine Learning
61 pages
Kantek DP
No ratings yet
Kantek DP
100 pages
AI-Driven Code Review For Software Quality: Step 1: Define The Scope
No ratings yet
AI-Driven Code Review For Software Quality: Step 1: Define The Scope
3 pages
Lecture 07
No ratings yet
Lecture 07
55 pages
Durgesh Report
No ratings yet
Durgesh Report
33 pages
Latex Class File For PW Stage II Report A Y 2024 25 1
No ratings yet
Latex Class File For PW Stage II Report A Y 2024 25 1
27 pages
682a14158a4d4 - Neural Networks in Code Generation How AI Is Changing Software Development
No ratings yet
682a14158a4d4 - Neural Networks in Code Generation How AI Is Changing Software Development
7 pages
Sat - 3.Pdf - Code Smell Detection Using Machine Learning
No ratings yet
Sat - 3.Pdf - Code Smell Detection Using Machine Learning
11 pages
Anjali 2224297
No ratings yet
Anjali 2224297
7 pages
Unit5 SE
No ratings yet
Unit5 SE
10 pages
AI Software Engineering Research Title
No ratings yet
AI Software Engineering Research Title
2 pages
Start Research
No ratings yet
Start Research
61 pages
AutoCodeRover: The Future of Program Improvement and GitHub Issue Resolution
No ratings yet
AutoCodeRover: The Future of Program Improvement and GitHub Issue Resolution
9 pages
Assignment Software Engineering
No ratings yet
Assignment Software Engineering
11 pages
Embedded Software Security Testing
No ratings yet
Embedded Software Security Testing
17 pages
Software Engineering Paper
No ratings yet
Software Engineering Paper
3 pages
Codeagent:: Autonomous Communicative Agents For Code Review
No ratings yet
Codeagent:: Autonomous Communicative Agents For Code Review
35 pages
Untitled Document
No ratings yet
Untitled Document
13 pages
P E E S C R: Redicting Xpert Valuations in Oftware ODE Eviews
No ratings yet
P E E S C R: Redicting Xpert Valuations in Oftware ODE Eviews
8 pages
Project Draft 1.2
No ratings yet
Project Draft 1.2
11 pages
Hoa Ai Meets Se DHBK
No ratings yet
Hoa Ai Meets Se DHBK
38 pages
Hybrid Model for Phishing Detection
100% (1)
Hybrid Model for Phishing Detection
96 pages
Enhancing Software Development Efficiency Through AI-Powered Code Generation
No ratings yet
Enhancing Software Development Efficiency Through AI-Powered Code Generation
12 pages
Software Testing Automation - Saeed Parsa
No ratings yet
Software Testing Automation - Saeed Parsa
765 pages
Oose Assignment: Submitted To
No ratings yet
Oose Assignment: Submitted To
2 pages
Digital Forensics Toolkit Phase 2 Report Final
No ratings yet
Digital Forensics Toolkit Phase 2 Report Final
69 pages
Maliha-Jan-25 Proejct Initial Pages
No ratings yet
Maliha-Jan-25 Proejct Initial Pages
11 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
5 pages
Factoring Thesis
100% (3)
Factoring Thesis
4 pages
Ai SDLC Cycle
No ratings yet
Ai SDLC Cycle
1 page
Reliability Improvement and Validation F
No ratings yet
Reliability Improvement and Validation F
118 pages
Hello Beyond Words - T2 - All in 1
No ratings yet
Hello Beyond Words - T2 - All in 1
92 pages
Engineering Heat Transfer Guide
No ratings yet
Engineering Heat Transfer Guide
45 pages
The Cult of Amoghpash Lokeshvara in Kathmandu Valley
No ratings yet
The Cult of Amoghpash Lokeshvara in Kathmandu Valley
2 pages
Lesson Plan in Grade 11 Cookery TVL
100% (10)
Lesson Plan in Grade 11 Cookery TVL
2 pages
The Quran and Its Biblical Reflexes Investigations Into The Genesis of A Religion 9781498569460 2018946880 9781498569453
No ratings yet
The Quran and Its Biblical Reflexes Investigations Into The Genesis of A Religion 9781498569460 2018946880 9781498569453
433 pages
Know Your Lord - LEARN ISLAM PDF
No ratings yet
Know Your Lord - LEARN ISLAM PDF
25 pages
Mans Best Friend British English Teacher
No ratings yet
Mans Best Friend British English Teacher
11 pages
Role of The Communicators
No ratings yet
Role of The Communicators
11 pages
Ntop-1 Worksheet-2 Grade-Iv Maths - 20230819 - 212930
No ratings yet
Ntop-1 Worksheet-2 Grade-Iv Maths - 20230819 - 212930
4 pages
Alster-Sumerian Proverbs
No ratings yet
Alster-Sumerian Proverbs
17 pages
Shell Scripting Interview Questions and Answers
No ratings yet
Shell Scripting Interview Questions and Answers
7 pages
Install Cli Log Admin
No ratings yet
Install Cli Log Admin
3 pages
Nihilism and The Sublime Postmodern The Hi Story of A Difficult Relationship From Romanticism To Postmodernism 1st Edition Will Slocombe
100% (24)
Nihilism and The Sublime Postmodern The Hi Story of A Difficult Relationship From Romanticism To Postmodernism 1st Edition Will Slocombe
84 pages
Christians & Biblical Inerrancy
No ratings yet
Christians & Biblical Inerrancy
24 pages
CS513 MJP CloudComputing Slips
No ratings yet
CS513 MJP CloudComputing Slips
27 pages
Joshua Landy, Michael T. Saler - The Re-Enchantment of The World - Secular Magic in A Rational Age-Stanford University Press (2009)
100% (2)
Joshua Landy, Michael T. Saler - The Re-Enchantment of The World - Secular Magic in A Rational Age-Stanford University Press (2009)
416 pages
Neelam
No ratings yet
Neelam
21 pages
Reclaiming Our Roman Catholic Birthright The Genius and Timeliness of The Traditional Latin Mass Peter Kwasniewski Download
No ratings yet
Reclaiming Our Roman Catholic Birthright The Genius and Timeliness of The Traditional Latin Mass Peter Kwasniewski Download
136 pages
Scienceofetymolo 00 Skeauoft
No ratings yet
Scienceofetymolo 00 Skeauoft
274 pages
Diagnosis Section 3
100% (2)
Diagnosis Section 3
60 pages
Software Engineering Principles
No ratings yet
Software Engineering Principles
38 pages
CAP680 11906059 Assignment 2
No ratings yet
CAP680 11906059 Assignment 2
7 pages
General DB Rules
No ratings yet
General DB Rules
18 pages
Rizal's Journey and Noli's Impact
No ratings yet
Rizal's Journey and Noli's Impact
8 pages
Storyworks: "The Eruption of Mt. Vesuvius": Text. Common Core Anchor Standards: R.1, R.2, R.3, R.4., W.1, SL.1, L, 5, L.6
No ratings yet
Storyworks: "The Eruption of Mt. Vesuvius": Text. Common Core Anchor Standards: R.1, R.2, R.3, R.4., W.1, SL.1, L, 5, L.6
11 pages
Affirmative Form (Forma Afirmativa) PDF
No ratings yet
Affirmative Form (Forma Afirmativa) PDF
4 pages
Arabic Lesson 3
No ratings yet
Arabic Lesson 3
7 pages
TA3 Workbook Answer Key - Module 1
No ratings yet
TA3 Workbook Answer Key - Module 1
5 pages
Tenses & Drill and Substitution
No ratings yet
Tenses & Drill and Substitution
8 pages
Imc 2022 Day 2 Solutions
No ratings yet
Imc 2022 Day 2 Solutions
5 pages