Introduction to Regular Expressions

Regular expressions are a mysterious, extremely useful programming tool. The syntax of Regular Expressions(RegEx) can be extremely daunting for those starting out. Additionally, I've found a lot of RegEx intro guides to be extremely vague, which personally turned me off to learning RegEx for many months. For a while, my "knowledge base" of RegEx consisted of copying and pasting Regular Expression snippets as needed. However, this guide will introduce you to the basics of Regular Expressions in a hopefully extremely accessible manner.

What are Regular Expressions Good For?

Regular Expressions, by definitions, are symbolic patterns that describe text. Regular Expressions serve many extremely useful purposes, including:

Formatting text
Extracting substrings from a string
Finding and replacing characters that are not strictly formatted
Validating form data
Matching text patterns

When it comes to web development, form validation tends to be the primary reason why a developer turns to Regular Expressions. Hopefully this tutorial, and an upcoming tutorial discussing the use of Regular Expressions in your work flow, will shake off the notion that Regex == Form Validation.

Conventions in this Article

For this tutorial, we'll use a generic syntax. Note that languages deal with escape characters and back references differently. Remember that whenever you're applying this knowledge to your language of choice. For your reference:

PHP Regex Syntax
JavaScript Regex Syntax

Thinking Like a RegEx Engine

Regular Expression Syntax

Literal Characters

See img tag example

Escaping Special Characters

[
]
(
)
?
*
+
^
$
\
.
{
}

See hotwire.com example

Character Classes

See example

Ranges in Character Classes

Escape the - with a backslash. (\-)
Have the - at the very end.

See math expression example

Special Characters in Character Classes

\ (escape character)
^ (negation)
- (range)
] (character class end)

See example for escaping special characters

Character Class Negation

See negated character class example

Predefined Character Classes

Predefined character class	Matches
\d	A digit
\w	An alphanumeric character or underscore
\s	A whitespace character
\D	A non digit
\W	A non alphanumeric character or underscore
\S	A non whitespace character

Quantifiers

Quantifier	Description
?	0 or 1 occurrences (optional)
*	0 or more occurrences
+	1 or more occurrences
{x}	Exactly x number of occurrences
{x, y}	Between x and y number of occurrences
{x,}	At least x number of occurrences

piggy piggy piggy
piggypiggypiggy
piggypiggy piggy

903-555-5555

See phone number example

Bar operator

Do not eat the cat
Do not hit the cat
Do not fight the cat
Do not scratch the cat

See the cat example

Resources

RegexPal

Regular-Expressions.info

About the Author:

Joseph is the lead developer of Vert Studios Follow Joseph on Twitter: @Joe_Query
Subscribe to the blog: RSS
Visit Joseph's site: joequery.me

About

Work

Services

Blog

Contact Us

Top Articles ⇒