0% found this document useful (0 votes)

3 views4 pages

Regular Expressions in Python

This document provides an introduction to regular expressions (regex) in Python, detailing the use of the re module for tasks such as searching and manipulating text. It covers the basics of regex syntax, common functions, and practical examples including phone number validation and email matching. Additionally, it emphasizes the efficiency of compiling regex patterns for reuse.

Uploaded by

clate6941

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views4 pages

Regular Expressions in Python

Uploaded by

clate6941

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Regular Expressions in Python

Regular expressions, or regex, are sequences of characters that form search patterns. Python provides
the re module for working with regex, allowing tasks like searching, matching, and manipulating text.
This guide introduces the basics of regex in Python with detailed explanations and numerous coding
examples.

1. Basics of Regular Expressions

A regex defines a pattern to match strings. For example:

• Literal Characters: Match exact characters (e.g., cat matches "cat").

• Metacharacters: Special symbols for patterns (e.g., ., *, +).

2. The re Module

To use regex, first import the re module:

import re

Common Functions in the re Module:

1. re.match(): Matches a pattern at the start of a string.

2. re.search(): Searches for a pattern anywhere in the string.

3. re.findall(): Returns all occurrences of a pattern as a list.

4. re.sub(): Replaces occurrences of a pattern with a specified string.

5. re.compile(): Compiles a regex pattern into a reusable object.

3. Regex Syntax

a) Metacharacters

Metacharacter Description Example

. Matches any character except newline a.c matches "abc", "adc"

^ Matches start of string ^Hello matches "Hello"

$ Matches end of string world$ matches "world"

* Matches 0 or more occurrences ca*t matches "ct", "cat"

Metacharacter Description Example

+ Matches 1 or more occurrences ca+t matches "cat" only

? Matches 0 or 1 occurrence ca?t matches "ct", "cat"

{} Matches specific repetitions a{2,3} matches "aa", "aaa"

b) Special Sequences

Sequence Description Example

\d Matches any digit \d+ matches "123"

\D Matches any non-digit \D+ matches "abc"

\w Matches any word character (alphanumeric) \w+ matches "word123"

\W Matches any non-word character \W+ matches "@#$"

\s Matches whitespace \s+ matches spaces

\S Matches non-whitespace \S+ matches "word"

4. Examples

a) re.match()

import re

# Example 1: Match at the start

text = "hello world"

result = re.match(r'hello', text)

print(result.group() if result else "No match") # Output: hello

# Example 2: Fails to match in the middle

result = re.match(r'world', text)

print(result) # Output: None

b) re.search()

# Example 1: Search anywhere in the string

result = re.search(r'world', text)

print(result.group() if result else "Not found") # Output: world

c) re.findall()

# Example 1: Find all occurrences

text = "abc 123 def 456"

result = re.findall(r'\d+', text)

print(result) # Output: ['123', '456']

d) re.sub()

# Example 1: Replace digits with '#'

text = "abc 123 def 456"

result = re.sub(r'\d', '#', text)

print(result) # Output: abc ### def ###

e) Regex with Special Characters

# Matching email

email = "example@example.com"

pattern = r'[a-zA-Z0-9._%+-]+@[a-zA-Z0-9.-]+\.[a-zA-Z]{2,}'

result = re.match(pattern, email)

print(result.group() if result else "Invalid email") # Output: example@example.com

5. Practical Examples

a) Validate a Phone Number

phone = "123-456-7890"

pattern = r'^\d{3}-\d{3}-\d{4}$'

if re.match(pattern, phone):
print("Valid phone number")

else:

print("Invalid phone number")

b) Extract URLs from Text

text = "Visit https://example.com and http://test.com."

pattern = r'https?://[a-zA-Z0-9.-]+'

urls = re.findall(pattern, text)

print(urls) # Output: ['https://example.com', 'http://test.com']

c) Password Validation

password = "StrongP@ssw0rd"

pattern = r'^(?=.*[A-Z])(?=.*[a-z])(?=.*\d)(?=.*[@$!%*?&])[A-Za-z\d@$!%*?&]{8,}$'

if re.match(pattern, password):

print("Strong password")

else:

print("Weak password")

6. Compiling Patterns for Efficiency

If a pattern is reused, compile it for better performance:

compiled_pattern = re.compile(r'\d+')

text = "Numbers: 123, 456, 789"

matches = compiled_pattern.findall(text)

print(matches) # Output: ['123', '456', '789']

Manipulating Text With Regular Expression in Python
No ratings yet
Manipulating Text With Regular Expression in Python
4 pages
Data Analysis Using Python Lab Ex3
No ratings yet
Data Analysis Using Python Lab Ex3
27 pages
Regular Expressions
No ratings yet
Regular Expressions
9 pages
Regular Expressions in Python
No ratings yet
Regular Expressions in Python
12 pages
Python Course: Session 6b - Regular Expressions
No ratings yet
Python Course: Session 6b - Regular Expressions
11 pages
Howto Regex
No ratings yet
Howto Regex
19 pages
Module 24 Regular Expressions Revisited
No ratings yet
Module 24 Regular Expressions Revisited
15 pages
Assignment 3
No ratings yet
Assignment 3
3 pages
Regular Expression Python
No ratings yet
Regular Expression Python
23 pages
Lec 06 - Regular Expression
No ratings yet
Lec 06 - Regular Expression
19 pages
Unit - 4 Regex
No ratings yet
Unit - 4 Regex
28 pages
Lecture 6 Re Basics
No ratings yet
Lecture 6 Re Basics
12 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
Howto Regex
No ratings yet
Howto Regex
17 pages
Chapter 10
No ratings yet
Chapter 10
28 pages
Howto Regex PDF
No ratings yet
Howto Regex PDF
20 pages
Regular
No ratings yet
Regular
9 pages
9python Simple Character Matches
No ratings yet
9python Simple Character Matches
19 pages
Regex Metacharacters Guide
No ratings yet
Regex Metacharacters Guide
49 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
Python Regex Guide
No ratings yet
Python Regex Guide
20 pages
RegEx in Python
No ratings yet
RegEx in Python
5 pages
Howto Regex
No ratings yet
Howto Regex
20 pages
UNIT4
No ratings yet
UNIT4
67 pages
Regex Lab for Data Scientists
No ratings yet
Regex Lab for Data Scientists
11 pages
Summary Python 1
No ratings yet
Summary Python 1
36 pages
Reg Ex
No ratings yet
Reg Ex
3 pages
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
No ratings yet
Unit7 RegularExpressionpdf 2023 10 17 09 16 29
17 pages
Python - Regular Expressions
No ratings yet
Python - Regular Expressions
13 pages
Python Regex Guide With Examples
No ratings yet
Python Regex Guide With Examples
4 pages
Full Python Regex Questions Detailed
No ratings yet
Full Python Regex Questions Detailed
4 pages
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
No ratings yet
Regular Expression HOWTO: Guido Van Rossum and The Python Development Team
18 pages
9 RegEx
No ratings yet
9 RegEx
57 pages
App Dev Using Python-Chapter 3
No ratings yet
App Dev Using Python-Chapter 3
16 pages
Unit-3 - Regular Expression
No ratings yet
Unit-3 - Regular Expression
15 pages
RegEx in Python
No ratings yet
RegEx in Python
6 pages
Python RegEx Guide: Metacharacters & Functions
No ratings yet
Python RegEx Guide: Metacharacters & Functions
104 pages
Regular Expression 01
No ratings yet
Regular Expression 01
48 pages
Python Regular Expressions
No ratings yet
Python Regular Expressions
14 pages
Python Assignment Date: 08-11-2021: Name-Navjeet Kaur Sap ID-500076160 Roll No - R134219065
No ratings yet
Python Assignment Date: 08-11-2021: Name-Navjeet Kaur Sap ID-500076160 Roll No - R134219065
3 pages
Python Notes - Unit 4
No ratings yet
Python Notes - Unit 4
13 pages
9 RegEx
No ratings yet
9 RegEx
57 pages
Regex Guide for Developers
No ratings yet
Regex Guide for Developers
10 pages
Python RegEx
No ratings yet
Python RegEx
11 pages
Python Unit 5
No ratings yet
Python Unit 5
143 pages
Python Regular Expressions Guide
100% (1)
Python Regular Expressions Guide
31 pages
Regular Expression L
No ratings yet
Regular Expression L
20 pages
Python 201 - (Slightly) Advanced Python Topics
No ratings yet
Python 201 - (Slightly) Advanced Python Topics
69 pages
Advanced Python Programming - Lesson No.002
No ratings yet
Advanced Python Programming - Lesson No.002
20 pages
Module5 RegularExpressions
No ratings yet
Module5 RegularExpressions
10 pages
Regular Expressions - Regexes in Python (Part 1) - Real Python
No ratings yet
Regular Expressions - Regexes in Python (Part 1) - Real Python
44 pages
Python Reg Expressions
No ratings yet
Python Reg Expressions
8 pages
Python How To Regex
No ratings yet
Python How To Regex
19 pages
Python 4
No ratings yet
Python 4
128 pages
Final Exam
No ratings yet
Final Exam
12 pages
EPA 2006 Architecture Standard and Guidance
No ratings yet
EPA 2006 Architecture Standard and Guidance
41 pages
2019 Subject Guide PDF
No ratings yet
2019 Subject Guide PDF
190 pages
PyCharm Reference Card
100% (1)
PyCharm Reference Card
2 pages
Types of Charts
100% (2)
Types of Charts
25 pages
Computer Organization and Architecture 8 Edition
No ratings yet
Computer Organization and Architecture 8 Edition
15 pages
RS232 Protocol for C210 Inkjet Printer
No ratings yet
RS232 Protocol for C210 Inkjet Printer
25 pages
As Chapter 3 Hardware
No ratings yet
As Chapter 3 Hardware
60 pages
Fat & SAT Procedure v01
No ratings yet
Fat & SAT Procedure v01
16 pages
O-RAN WG4 CTI-TCP 0-R003-v04 00
No ratings yet
O-RAN WG4 CTI-TCP 0-R003-v04 00
49 pages
HP Probook 650 G8 Notebook PC: Modern Design For The Enterprise
No ratings yet
HP Probook 650 G8 Notebook PC: Modern Design For The Enterprise
4 pages
TIB973 Consys 24.4
No ratings yet
TIB973 Consys 24.4
40 pages
HackLikePro v3
0% (1)
HackLikePro v3
72 pages
Easy Learning Javascript Javascript For Beginners Guide by Yang Hu
No ratings yet
Easy Learning Javascript Javascript For Beginners Guide by Yang Hu
84 pages
Chapter - 02 Number System
No ratings yet
Chapter - 02 Number System
67 pages
Advanced Web Attacks and Exploitation: Figure 20: Burp Suite Repeater Previous Request and Response
No ratings yet
Advanced Web Attacks and Exploitation: Figure 20: Burp Suite Repeater Previous Request and Response
4 pages
Classification of Fingerprint
No ratings yet
Classification of Fingerprint
4 pages
Rience 1-3
No ratings yet
Rience 1-3
22 pages
Beginning JSP 2-From Novice To Professional
No ratings yet
Beginning JSP 2-From Novice To Professional
39 pages
Project Schedule
No ratings yet
Project Schedule
6 pages
CananMertese Resume
No ratings yet
CananMertese Resume
2 pages
Bus Scheduling and Reservation System Abstract
No ratings yet
Bus Scheduling and Reservation System Abstract
13 pages
Lesson 4
No ratings yet
Lesson 4
18 pages
Breakout Board V5 Type English User Manual PDF
No ratings yet
Breakout Board V5 Type English User Manual PDF
14 pages
EEG User Manual for Clinicians
No ratings yet
EEG User Manual for Clinicians
36 pages
Review On Cyber Crime and Security
No ratings yet
Review On Cyber Crime and Security
4 pages
Mini Project Report On:: Arduino Based Samrt Notice Board
No ratings yet
Mini Project Report On:: Arduino Based Samrt Notice Board
12 pages
AFPX-COM5 Ethernet Communication Guide
No ratings yet
AFPX-COM5 Ethernet Communication Guide
34 pages
09 16 Ntref
No ratings yet
09 16 Ntref
49 pages
Module 01: Django Introduction
No ratings yet
Module 01: Django Introduction
3 pages

Regular Expressions in Python

Uploaded by

Regular Expressions in Python

Uploaded by

Regular Expressions in Python

1. Basics of Regular Expressions

A regex defines a pattern to match strings. For example:

• Literal Characters: Match exact characters (e.g., cat matches "cat").

• Metacharacters: Special symbols for patterns (e.g., ., *, +).

To use regex, first import the re module:

Common Functions in the re Module:

1. re.match(): Matches a pattern at the start of a string.

2. re.search(): Searches for a pattern anywhere in the string.

3. re.findall(): Returns all occurrences of a pattern as a list.

4. re.sub(): Replaces occurrences of a pattern with a specified string.

5. re.compile(): Compiles a regex pattern into a reusable object.

Metacharacter Description Example

. Matches any character except newline a.c matches "abc", "adc"

^ Matches start of string ^Hello matches "Hello"

$ Matches end of string world$ matches "world"

* Matches 0 or more occurrences ca*t matches "ct", "cat"

+ Matches 1 or more occurrences ca+t matches "cat" only

? Matches 0 or 1 occurrence ca?t matches "ct", "cat"

{} Matches specific repetitions a{2,3} matches "aa", "aaa"

Sequence Description Example

\d Matches any digit \d+ matches "123"

\D Matches any non-digit \D+ matches "abc"

\w Matches any word character (alphanumeric) \w+ matches "word123"

\W Matches any non-word character \W+ matches "@#$"

\s Matches whitespace \s+ matches spaces

\S Matches non-whitespace \S+ matches "word"

# Example 1: Match at the start

text = "hello world"

result = re.match(r'hello', text)

print(result.group() if result else "No match") # Output: hello

# Example 2: Fails to match in the middle

result = re.match(r'world', text)

print(result) # Output: None

# Example 1: Search anywhere in the string

result = re.search(r'world', text)

print(result.group() if result else "Not found") # Output: world

# Example 1: Find all occurrences

text = "abc 123 def 456"

result = re.findall(r'\d+', text)

print(result) # Output: ['123', '456']

# Example 1: Replace digits with '#'

text = "abc 123 def 456"

result = re.sub(r'\d', '#', text)

print(result) # Output: abc ### def ###

e) Regex with Special Characters

result = re.match(pattern, email)

print(result.group() if result else "Invalid email") # Output: example@example.com

a) Validate a Phone Number

print("Invalid phone number")

b) Extract URLs from Text

text = "Visit https://example.com and http://test.com."

urls = re.findall(pattern, text)

print(urls) # Output: ['https://example.com', 'http://test.com']

6. Compiling Patterns for Efficiency

If a pattern is reused, compile it for better performance:

text = "Numbers: 123, 456, 789"

print(matches) # Output: ['123', '456', '789']

You might also like