0% found this document useful (0 votes)

4 views7 pages

Num Py Pandas Interview Qa

The document provides a comprehensive overview of NumPy and Pandas, including key concepts, functions, and differences between them. It covers topics such as array creation, data manipulation, handling missing values, and statistical operations. Additionally, it addresses performance considerations and methods for efficiently managing large datasets.

Uploaded by

bamboocader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views7 pages

Num Py Pandas Interview Qa

Uploaded by

bamboocader

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

NumPy and Pandas Interview Questions and Answers

1. What is NumPy and why is it used in data science?

NumPy (Numerical Python) is a powerful library for numerical computations in Python. It provides support
for arrays, matrices, and a large number of mathematical functions. In data science, NumPy is used for fast
numerical computations, efficient handling of large datasets, and serves as the foundation for libraries like
Pandas and SciPy.

2. What is the difference between arange and range function?

• range() is a built-in Python function that returns a range object.

• np.arange() is NumPy’s version, which returns a NumPy array.
• np.arange() supports float steps (e.g., np.arange(0, 1, 0.1) ), unlike range() .

3. How do you create a NumPy array? Provide examples.

• From list: np.array([1, 2, 3])

• Using functions: np.zeros((2,3)) , np.ones((3,3)) , np.eye(3) , np.linspace(0,10,5)
• Random: np.random.rand(2,2)

4. Difference between Python list and NumPy array

• List: Heterogeneous, slower operations, no vectorization.

• NumPy Array: Homogeneous, faster with vectorized operations.

5. Element-wise operations in NumPy

arr1 = np.array([1, 2, 3])

arr2 = np.array([4, 5, 6])
result = arr1 + arr2 # [5 7 9]

Supports + , - , * , / , ** etc.

6. NumPy dimensions and shapes

• Shape: Tuple representing dimensions e.g., (3, 4)

• Manipulate using reshape() , ravel() , flatten() , transpose()

7. Broadcasting in NumPy

Broadcasting allows arithmetic operations between arrays of different shapes by expanding one or more
arrays to a compatible shape.

8. Select specific subset

1
arr = np.array([1, 2, 3, 4])
arr[1:3] # [2 3]

Supports slicing, boolean indexing, and fancy indexing.

9. Aggregation functions

np.sum() , np.mean() , np.min() , np.max() , np.std() , np.var()

np.mean([1, 2, 3]) # 2.0

10. Handling missing values

Use np.nan , np.isnan() , and aggregation functions like np.nanmean() , np.nansum()

11. Handling large datasets

NumPy uses contiguous memory blocks and vectorized operations, making it efficient in handling large
arrays with minimal memory overhead.

12. Matrix multiplication

np.dot(A, B) or A @ B

13. Linear algebra module

np.linalg : functions like inv() , eig() , svd() , solve() for solving systems, finding inverses,
eigenvalues, etc.

14. Random functions

np.random.rand() , np.random.randint() , np.random.normal() , np.random.seed()

15. Vectorization

Instead of loops:

np.vectorize(lambda x: x**2)(np.array([1,2,3]))

16. np.where() usage

Conditional filtering:

2
np.where(arr > 0, 1, 0)

17. Statistical operations

mean() , median() , percentile() , std() , etc.

18. Masked arrays

Useful for ignoring invalid entries:

masked = np.ma.masked_array(data, mask=condition)

19. np.copy() vs np.view()

• copy() creates a new array.

• view() creates a new view of the same data.

20. Array reshaping

Use reshape() , resize() , flatten()

21. Concatenate vs vstack

• np.concatenate([a, b], axis=0)

• np.vstack([a, b]) : vertical stack

22. Polynomial functions

np.poly1d() , np.polyfit() , np.polyval() for polynomial creation and evaluation.

23. Memory layout

Arrays are stored in contiguous blocks (row-major). This enables faster computations.

24. Statistical tests

Basic support via np.corrcoef() , np.cov() . Advanced in scipy.stats

25. np.histogram()

Used to compute the frequency distribution:

np.histogram(data, bins=5)

3
26. Array initialization

np.zeros() , np.ones() , np.full() , np.eye() , np.empty()

27. Complex numbers

arr = np.array([1+2j, 3+4j])

np.real(arr), np.imag(arr)

28. FFT

np.fft.fft(signal)
np.fft.ifft(signal)

29. np.unique()

Returns sorted unique elements and their counts.

np.unique(arr, return_counts=True)

30. What is Pandas?

Pandas is a data analysis and manipulation library built on NumPy. It provides Series and DataFrame .

31. Create DataFrame

df = pd.DataFrame({'A':[1,2], 'B':[3,4]})

32. Reading data

pd.read_csv() , read_excel() , read_sql() , read_json()

33. Missing data handling

df.isna() , df.fillna() , df.dropna()

34. Aggregation and grouping

df.groupby('column').agg(['sum','mean'])

4
35. Merge and join

• pd.merge() for joining on keys.

• df.join() for joining on index.

36. Filtering and sorting

df[df['col'] > 5], df.sort_values('col')

37. Row/column manipulation

df.drop() , df.insert() , df.rename()

38. apply() method

Applies a function column or row-wise.

39. Indexing methods

.loc[] , .iloc[] , .at[] , .iat[]

40. Time series handling

pd.to_datetime() , resample() , rolling()

41. Pivot table

pd.pivot_table(df, values='val', index='A', columns='B')

42. Normalization

df['col'] = (df['col'] - df['col'].mean()) / df['col'].std()

43. pd.concat()

Used for appending/combining dataframes row-wise or column-wise.

44. rolling()

Window-based calculations:

df.rolling(window=3).mean()

5
45. Transformation and aggregation

df.groupby('A').transform('mean')

46. Multi-index

df.set_index(['col1', 'col2'])

47. query() method

Filter using string expressions.

df.query('col > 5')

48. Large datasets

Use chunksize in readers, filter early, optimize data types.

49. Categorical data

astype('category') reduces memory.

50. Merge vs Join

merge() is more versatile, join() is convenient for index-based joins.

51. Slicing and selection

Using .loc[] , .iloc[] , slicing syntax df[1:5]

52. Airbnb-style question: Efficiently handling, transforming, and visualizing data using Pandas for
business decision-making.

53. Complex groupby()

df.groupby(['A','B']).agg({'C':'sum', 'D':'mean'})

54. applymap()

Element-wise function application for DataFrames.

6
55. pd.to_datetime()

pd.to_datetime(df['date_column'])

56. Advanced missing values

Interpolate, forward/backward fill: df.interpolate() , df.fillna(method='bfill')

57. pd.cut() and pd.qcut()

Binning continuous data into discrete intervals.

58. Hierarchical indexing

Used for multi-level indexes, especially after groupby or pivot.

59. pd.melt()

Unpivots a DataFrame from wide to long format.

60. Custom aggregation

df.groupby('A').agg({'B': lambda x: x.max() - x.min()})

61. Performance considerations

Avoid loops, use vectorized ops, downcast data types, filter early.

62. query() for efficient selection

Uses internal expression evaluation engine, faster for large data.

End of Document.

Num Py Deep Dive
No ratings yet
Num Py Deep Dive
5 pages
Python Libraries
No ratings yet
Python Libraries
6 pages
Python Numpy
No ratings yet
Python Numpy
4 pages
Num Py Detailed - Intro To Indexing & Filtering
No ratings yet
Num Py Detailed - Intro To Indexing & Filtering
4 pages
Report
No ratings yet
Report
18 pages
FINAL FDS MANUAL Print
No ratings yet
FINAL FDS MANUAL Print
55 pages
Usage of NumPy For Numerical Data in Detail
No ratings yet
Usage of NumPy For Numerical Data in Detail
52 pages
Numpy
No ratings yet
Numpy
9 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Cheat Sheet: Python For Data Science
No ratings yet
Cheat Sheet: Python For Data Science
4 pages
Comprehensive NumPy Operations Guide
No ratings yet
Comprehensive NumPy Operations Guide
8 pages
Num Py Detailed - Intro To Indexing & Filtering
No ratings yet
Num Py Detailed - Intro To Indexing & Filtering
4 pages
NumPy Is
No ratings yet
NumPy Is
8 pages
05-Unit-V Python Lecture Notes
No ratings yet
05-Unit-V Python Lecture Notes
14 pages
NumPy and Pandas
No ratings yet
NumPy and Pandas
12 pages
Numpy Handbook
No ratings yet
Numpy Handbook
16 pages
NumPy in Python
No ratings yet
NumPy in Python
4 pages
Week7B PBD
No ratings yet
Week7B PBD
3 pages
Python Numpy and Pandas Interview Questions
No ratings yet
Python Numpy and Pandas Interview Questions
16 pages
NumPy Extended Cheatsheet Guide
No ratings yet
NumPy Extended Cheatsheet Guide
8 pages
Numpy Data Analytics
No ratings yet
Numpy Data Analytics
13 pages
Learninng Plan
No ratings yet
Learninng Plan
6 pages
EXP1-siddhant Gupta (23 - SE - 148)
No ratings yet
EXP1-siddhant Gupta (23 - SE - 148)
17 pages
Attachment 3 Python For Data Analysis Lyst9850
No ratings yet
Attachment 3 Python For Data Analysis Lyst9850
31 pages
Dav 2 Unit
No ratings yet
Dav 2 Unit
55 pages
NUMPY
No ratings yet
NUMPY
33 pages
Unit 5 - Python Programming
No ratings yet
Unit 5 - Python Programming
9 pages
Unit 3 (FODS)
No ratings yet
Unit 3 (FODS)
34 pages
NumPy Concepts and Tricks For Python Programs
No ratings yet
NumPy Concepts and Tricks For Python Programs
11 pages
NumPy Is A Powerful Python Library Used For Numerical Computing. Here Are S - 20250101 - 154624 - 0000
No ratings yet
NumPy Is A Powerful Python Library Used For Numerical Computing. Here Are S - 20250101 - 154624 - 0000
8 pages
Python Numpy Pandas CheatSheet
No ratings yet
Python Numpy Pandas CheatSheet
4 pages
NumPy & Pandas
No ratings yet
NumPy & Pandas
27 pages
NumPy and Pandas Tutorial
No ratings yet
NumPy and Pandas Tutorial
8 pages
Unit II - Notes
No ratings yet
Unit II - Notes
10 pages
Data Science Lab Manual
No ratings yet
Data Science Lab Manual
63 pages
Numpy
No ratings yet
Numpy
4 pages
Practicals 1 To 4
No ratings yet
Practicals 1 To 4
15 pages
NumPy for Scientific Computing
No ratings yet
NumPy for Scientific Computing
47 pages
13 - NumPy
No ratings yet
13 - NumPy
46 pages
Python Unit IV
No ratings yet
Python Unit IV
12 pages
ML Lab File Vijay Kumar
No ratings yet
ML Lab File Vijay Kumar
16 pages
Numpy
No ratings yet
Numpy
9 pages
Datascience Internship
No ratings yet
Datascience Internship
43 pages
Ot Lab 6
No ratings yet
Ot Lab 6
13 pages
Numpy Tutorial
No ratings yet
Numpy Tutorial
19 pages
40 NumPy and Pandas Interview Questions With Answers 1740141557
No ratings yet
40 NumPy and Pandas Interview Questions With Answers 1740141557
6 pages
Numpy & Pandas
No ratings yet
Numpy & Pandas
13 pages
Unit 3 - Numpy - VP
No ratings yet
Unit 3 - Numpy - VP
53 pages
Python NumPy for Developers
No ratings yet
Python NumPy for Developers
43 pages
Int254 Unit 2
No ratings yet
Int254 Unit 2
33 pages
NumPy Essentials for Beginners
No ratings yet
NumPy Essentials for Beginners
19 pages
10 Numpy
No ratings yet
10 Numpy
39 pages
FDS Record-1-4
No ratings yet
FDS Record-1-4
18 pages
Dse Unit 3
No ratings yet
Dse Unit 3
12 pages
PyDays Day-2 - Final
No ratings yet
PyDays Day-2 - Final
26 pages
Self Numpy
No ratings yet
Self Numpy
6 pages
FDS Exp3
No ratings yet
FDS Exp3
5 pages
Data Handling Module
No ratings yet
Data Handling Module
10 pages
1.Std Costing Accounting Entries
No ratings yet
1.Std Costing Accounting Entries
10 pages
Deep Learning Syllabus
No ratings yet
Deep Learning Syllabus
24 pages
Cube-Voyager - Technical Brochure
No ratings yet
Cube-Voyager - Technical Brochure
3 pages
Innovative Air and Gas Movement Solutions For Power Generation
No ratings yet
Innovative Air and Gas Movement Solutions For Power Generation
28 pages
Vehicle Technology: Curtis Instruments, Inc
No ratings yet
Vehicle Technology: Curtis Instruments, Inc
11 pages
Acceptance for Road Repair Contract
No ratings yet
Acceptance for Road Repair Contract
1 page
Grove 1997 VIII On The Gas Voltaic Battery Experiments Made With A View of Ascertaining The Rationale of Its Action and
No ratings yet
Grove 1997 VIII On The Gas Voltaic Battery Experiments Made With A View of Ascertaining The Rationale of Its Action and
23 pages
Marine Fuel Oil Insights
No ratings yet
Marine Fuel Oil Insights
118 pages
Mech - Design1 - 2023 - L08 Gear Design (Continued) NO Audio
No ratings yet
Mech - Design1 - 2023 - L08 Gear Design (Continued) NO Audio
72 pages
Water System Flange Dimensions
No ratings yet
Water System Flange Dimensions
10 pages
Oracle E-Business Tax Extensibility
No ratings yet
Oracle E-Business Tax Extensibility
5 pages
Install Ohmw 4.01.01.rc.03
No ratings yet
Install Ohmw 4.01.01.rc.03
3 pages
Bochaver Et Al 20221687066806424
No ratings yet
Bochaver Et Al 20221687066806424
17 pages
Grade 8 Cbse Math 2nd Term Sample Paper 1
100% (1)
Grade 8 Cbse Math 2nd Term Sample Paper 1
2 pages
Rectifiers & Voltage Regulating Filters
No ratings yet
Rectifiers & Voltage Regulating Filters
10 pages
Complex Numbers For High School
No ratings yet
Complex Numbers For High School
60 pages
CEM - Part VI - Chap 5 pt1
No ratings yet
CEM - Part VI - Chap 5 pt1
176 pages
Eaton Metal Seals
No ratings yet
Eaton Metal Seals
60 pages
Pressurization Unit Specs & Details
No ratings yet
Pressurization Unit Specs & Details
16 pages
Understanding Quadrilaterals
No ratings yet
Understanding Quadrilaterals
2 pages
Pps Question Bank
No ratings yet
Pps Question Bank
2 pages
3.16 Swiveling - CYCLE800 (SW 6.2 and Later)
No ratings yet
3.16 Swiveling - CYCLE800 (SW 6.2 and Later)
32 pages
Thesis Assignment
No ratings yet
Thesis Assignment
4 pages
HWSC S A0012940590 1
No ratings yet
HWSC S A0012940590 1
4 pages
Pair of Linear Equations in Two Variables
No ratings yet
Pair of Linear Equations in Two Variables
9 pages
J Diamond 2018 03 006
No ratings yet
J Diamond 2018 03 006
22 pages
Ah en Ax SW Suite Change Notes 8040 en 24
100% (1)
Ah en Ax SW Suite Change Notes 8040 en 24
89 pages
Zayat - Wireless Infra Structure & DDF
No ratings yet
Zayat - Wireless Infra Structure & DDF
18 pages
Engineering Differential Equations
No ratings yet
Engineering Differential Equations
57 pages
Mathmatics Demarcation - Wiskunde Afbakening
No ratings yet
Mathmatics Demarcation - Wiskunde Afbakening
4 pages