Dileep Kumar
Khammam, Telangana | dileepreddy66@gmail.com | +91 6305271576
PROFESSIONAL SUMMARY
Experienced Data Engineer with expertise in SQL, Python, Hadoop, and Apache Spark.
Proven ability to optimize data pipelines, monitor large-scale production jobs, and deliver
actionable insights through data analysis. Skilled in building end-to-end data solutions to
improve efficiency and drive business decisions.
WORK EXPERIENCE
Data Engineer and Business Analyst | EXL Service | Noida, UP Mar 2022 – August 2024
Developed programs to deliver Standard or Customized reports to clients, enhancing
decision-making processes.
Executed Hive queries to pull required data from Data Warehouse, ingest input files
into Hive, and perform analytical operations.
Automated and productionized over 400 jobs per month, achieving 99.9% uptime and
improving system reliability.
Optimized data processing workflows, reducing execution time by 30% and lowering
resource costs.
Created and managed Hive tables, data loading pipelines, and complex queries for
large datasets.
Quickly identified and resolved project issues, minimizing impact on deadlines and
overall project progress.
PROJECTS
IPL Data Analysis
Built a data pipeline to read IPL dataset from an S3 bucket, applying Python and
Apache Spark for data transformations and pre-processing.
Analysed match data to generate insights on player performance and match outcomes,
driving better team strategies.
Visualized results using Tableau, providing stakeholders with actionable insights
through interactive dashboards.
Sample YouTube Data Analysis
Analysed sample YouTube data to extract insights on user engagement, content
trends, and growth metrics.
Designed and executed SQL queries to generate detailed reports for scenario-based
analysis.
EDUCATION
Bachelor of Technology (B. Tech.) National Institute of Technology, Nagpur, India
SKILLS
Programming Languages: Python, SQL, Apache Spark, Shell Scripting
Big Data Tools: Hadoop, Hive, ETL Processes
Data Analysis: Data Cleaning, Pandas, Tableau, Data Visualization
Databases: MySQL Server
Other Tools: Excel, Jira, Linux, Bash
Core Skills: Data Pipeline Development, Production Job Monitoring, Performance
Optimization, Data Warehousing
ADDITIONAL INFORMATION
Open to relocation and adept at working in fast-paced, collaborative environments.
Passionate about deriving insights from data to solve complex business challenges.