Formation 90 min read Data Manipulation with Pandas

πŸ“¦ Filtering and Selection with Pandas

Python & Data Science Chapter : Data Manipulation with Pandas Sub-chapter : Filtering and selection

Learning objectives

🎯 Objectives:\n
1Select columns\n2. Filter rows with conditions\n3. Use loc and iloc\n4. Sort data

Introduction

πŸ“– Data selection and filtering are essential in data analysis.

Theoretical content

Selection and filtering:\n
PYTHON
\ndf["column"]\ndf[df["age"] > 25]\n

Practical examples

πŸ’» Example: Sales analysis\n
PYTHON
\nsales[sales["sales"] > 200]\n

Best practices

1Use loc for labels\nβœ… 2. Use iloc for positions\nβœ… 3. Use & for AND, | for OR

Common pitfalls

Modifying a view instead of a copy\n
Use .copy() to create a copy

Summary

df["col"]: select column\nβœ… df[condition]: filtering\nβœ… df.loc[]: label selection\nβœ… df.iloc[]: position selection\nβœ… df.sort_values(): sorting

Additional resources

πŸ“š pandas.pydata.org/docs/user_guide/indexing.html