KEMBAR78
Question Data | PDF
0% found this document useful (0 votes)
14 views1 page

Question Data

The document contains a series of technical questions across various domains including SQL, Python for Data Analysis, Statistics & Probability, Data Visualization & BI Tools, and Case Study & Business Problem Solving. Each section presents specific tasks or concepts that require understanding and application of data analysis techniques. The questions aim to assess knowledge and skills relevant to data professionals.

Uploaded by

Tendry Sarika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
14 views1 page

Question Data

The document contains a series of technical questions across various domains including SQL, Python for Data Analysis, Statistics & Probability, Data Visualization & BI Tools, and Case Study & Business Problem Solving. Each section presents specific tasks or concepts that require understanding and application of data analysis techniques. The questions aim to assess knowledge and skills relevant to data professionals.

Uploaded by

Tendry Sarika
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as TXT, PDF, TXT or read online on Scribd
You are on page 1/ 1

🔹 SQL Questions

1. Write an SQL query to find the second-highest salary from an employee table.
2. How do you remove duplicates from a table without using `DISTINCT`?
3. Write an SQL query to calculate the running total of sales per month.
4. Explain the difference between `INNER JOIN`, `LEFT JOIN`, `RIGHT JOIN`, and
`FULL JOIN`.
5. Given a transactions table, write a query to find the top 3 customers with the
highest spending.

🔹 Python for Data Analysis


6. How would you read a large CSV file in Python efficiently?
7. Explain the difference between `apply()`, `map()`, and `vectorization` in
Pandas.
8. Given a dataset, how would you detect and handle missing values?
9. How do you merge two Pandas dataframes on multiple columns?
10. Write a Python script to find outliers in a dataset using the IQR method.

🔹 Statistics & Probability


11. What is the Central Limit Theorem, and why is it important?
12. How do you check if a dataset is normally distributed?
13. What is the difference between correlation and covariance?
14. Explain Type I and Type II errors in hypothesis testing.
15. How would you determine if a coin is biased based on 100 flips?

🔹 Data Visualization & BI Tools


16. How would you visualize time-series data in Power BI/Tableau?
17. What are the different types of joins available in Power BI?
18. Explain the difference between a heatmap and a scatter plot.
19. How would you create a dashboard to track key business KPIs?
20. What are calculated fields in Tableau/Power BI?

🔹 Case Study & Business Problem Solving


21. If a company’s sales dropped by 20%, how would you analyze the cause?
22. Given transaction data, how would you identify fraudulent activities?
23. How would you segment customers based on their purchasing behavior?
24. If you see a sudden spike in website traffic, how would you investigate?
25. How would you forecast future sales for a retail company?

You might also like