What is the best regex for email validation?

For most apps, use the WHATWG/HTML5 pattern that browsers apply to . It accepts every address real users have while rejecting the obvious garbage. Going stricter mainly causes false rejections of valid addresses, so reserve tighter patterns for cases where you control the allowed domains.

Why is the official RFC 5322 email regex so long?

RFC 5322 permits quoted strings, comments, and nested whitespace, so a fully compliant regex balloons past 6,000 characters. It still cannot confirm a mailbox exists, so the effort buys almost nothing in practice. Most teams use the shorter HTML5 pattern instead.

Can regex fully validate an email address?

No. Regex only checks syntax. It cannot tell you whether the domain has an MX record or whether the mailbox accepts mail. The only reliable confirmation is sending a verification email and waiting for the user to click the link inside it.

Does the HTML5 email pattern allow plus signs and dots?

Yes. The local part permits . ! # $ % & ' * + / = ? ^ _ ` { | } ~ and hyphens, so addresses like user+tag@example.com and first.last@example.com pass. This is why tag-based filtering with the plus sign works on most mail providers.

Email Regex Patterns Compared: Strict vs Loose

Every developer writes an email regex eventually, copies a scary-looking one off a forum, and never questions whether it actually works. Most of them quietly reject valid addresses or accept obvious junk. This post compares the strict and loose email regex patterns side by side, explains why the famous "RFC 5322 official" regex is a trap, and gives you one pattern you can paste in and trust.

TL;DR

Use the WHATWG/HTML5 pattern as your default email regex — it matches real addresses and is short.
The "official RFC 5322 regex" is over 6,000 characters and still proves nothing useful.
Strict patterns mostly cause false rejections of valid addresses like [email protected].
Regex checks syntax only; it can't confirm a mailbox exists.
Always finish validation by sending a confirmation email.

Why the RFC 5322 official email regex is useless

The internet loves to share a monstrous "RFC 5322 compliant" email regex as if length equals correctness. It doesn't. Understanding why means looking at what the underlying specs actually allow.

What the spec actually permits

Email syntax is defined by RFC 5322, the Internet Message Format. That grammar is far more permissive than most people expect. It allows quoted local parts, embedded comments in parentheses, and folding whitespace. All of the following are technically valid:

"[email protected]"@example.com
much."more\ unusual"@example.com
user(this is a comment)@example.com

A regex that faithfully matches every legal RFC 5322 form runs past 6,000 characters. It accepts addresses no real mail provider would ever issue, and it still cannot tell you whether the address is deliverable.

Too strict and too lax at the same time

The HTML specification is blunt about this. It calls RFC 5322's definition "too strict (before the @ character), too vague (after the @ character), and too lax (allowing comments, whitespace characters, and quoted strings in manners unfamiliar to most users)." That is the rare case where the formal standard is simultaneously wrong in both directions for practical use.

Length limits regex usually ignores

Syntax aside, RFC 5321 (SMTP) caps the real lengths: the local part may not exceed 64 octets, the domain may not exceed 255 octets, and the whole address is limited to 254 octets in transport. A typical email regex checks none of this. If length matters to you, enforce it with a simple .length check rather than trying to express octet counts inside the pattern.

Email regex patterns compared: strict vs loose

There are really only two camps: a permissive pattern that errs toward accepting addresses, and a stricter one that locks down the local part. Here are both, with the trade-offs spelled out.

The loose pattern (accept almost everything)

The loose approach checks for the basic shape — something, an @, a domain with at least one dot — and trusts the mail server to reject the rest:

/^[^\s@]+@[^\s@]+\.[^\s@]+$/

This rejects empty strings, spaces, and missing @ signs, which catches the overwhelming majority of typos. It will happily accept [email protected], but that is a deliberate choice: you would rather let an odd-looking address through than block a real user.

The strict pattern (lock down the local part)

The stricter, production-grade choice is the pattern browsers actually use for <input type="email">. The HTML Standard publishes it as a deliberate "willful violation" of RFC 5322 — narrower than the spec, but matched to reality:

/^[a-zA-Z0-9.!#$%&'*+/=?^_`{|}~-]+
@[a-zA-Z0-9](?:[a-zA-Z0-9-]{0,61}
[a-zA-Z0-9])?(?:\.[a-zA-Z0-9]
(?:[a-zA-Z0-9-]{0,61}[a-zA-Z0-9])?)*$/

(Written across lines for readability — it is one pattern with no line breaks.) It allows every character the local part legitimately uses, enforces the 63-character DNS label limit on each domain segment, and rejects the quoted-string and comment forms nobody types. This is the same logic MDN documents for the email input type.

Strict vs loose at a glance

Pattern	Best for
Loose `[^\s@]+@[^\s@]+\.[^\s@]+`	Quick checks, forms, catching typos
HTML5 / WHATWG	Production validation, sign-up flows
RFC 5322 "full"	Almost never — academic interest only

A second table for the behavior that trips people up:

Address	Loose	HTML5
`[email protected]`	pass	pass
`[email protected]`	pass	pass
`user@localhost`	pass	fail
`"quoted"@example.com`	fail	fail

What email regex should I actually use?

If you take one thing away: default to the HTML5/WHATWG pattern and stop there unless you have a concrete reason not to.

Use the HTML5 pattern as your default

You often don't need any custom regex at all. Marking up a field as <input type="email" required> makes the browser apply the exact pattern above for free, with built-in error messaging and accessibility. When you do need it in JavaScript, reuse the same pattern so client and server agree. Paste it into iKit's Regex Tester to confirm it behaves before you ship it.

When to go stricter

Tighten the pattern only when you genuinely control the input domain — for example, an internal tool that should accept only @yourcompany.com addresses. In that case the restriction is a business rule, not email validation, and it belongs in a focused pattern like ^[a-z0-9.]+@yourcompany\.com$. Don't bolt arbitrary strictness onto a public sign-up form; you will reject real customers.

Don't rely on regex alone

Regex validates syntax, never existence. A perfectly formed address can point at a domain with no mail server or a mailbox that bounces everything. The only authoritative check is to send a confirmation email with a unique link and treat the address as unverified until the user clicks it. Some teams add an MX-record DNS lookup as a cheap pre-filter, but confirmation is the part that actually matters.

How to test an email regex without false positives

A pattern is only as good as the cases you throw at it. Testing is where most "working" regexes fall apart.

Build a test corpus of valid and invalid addresses

Before trusting any pattern, run it against a fixed list that mixes the easy cases with the awkward ones:

[email protected] — must pass
[email protected] — must pass
[email protected] — must pass (plus-addressing)
name@localhost — should fail for public sign-ups
two@@example.com — must fail
spaces [email protected] — must fail

Keep the list in version control and re-run it whenever someone "improves" the regex. When you tweak a pattern and want to see exactly which lines changed behavior, you can paste the before/after match output into the Diff Checker and compare side by side.

Watch for catastrophic backtracking

Patterns with nested quantifiers like (a+)+ can blow up exponentially on certain inputs — a denial-of-service vector known as ReDoS. The WHATWG pattern is safe because its quantifiers don't overlap, but if you write your own, test a long pathological string (say, 50 repeated characters with no @) and make sure matching stays instant. The Regex Tester will hang visibly if your pattern backtracks badly, which is a useful early warning.

Internationalized (Unicode) email addresses

Modern email allows non-ASCII addresses such as 用户@例子.公司 under the SMTPUTF8 extension. The classic ASCII patterns above reject these by design. If your audience needs them, accept Unicode in the local part with a broader character class and rely on confirmation email for the real check — most systems still normalize or punycode the domain before sending.

One practical wrinkle: once you accept an address, you often need to put it somewhere else — a mailto: link, a query string, or a webhook payload. Unicode and even ordinary characters like + have to be percent-encoded in those contexts, or the address breaks downstream. If you're embedding a validated address into a URL, run it through the URL Encoder first so the @, +, and any non-ASCII bytes survive transport intact.

A quick checklist before you ship

Before any email regex goes to production, confirm three things. First, the pattern accepts plus-addressing and dotted local parts — those are the addresses real users type. Second, it fails fast on garbage rather than backtracking. Third, there is a confirmation-email step behind it, because no pattern can prove a mailbox exists. Get those three right and the exact regex you chose barely matters; get them wrong and the cleverest pattern in the world still ships broken sign-ups.

References

RFC 5322 — Internet Message Format — the grammar that defines legal email syntax; cited for quoted strings, comments, and why the "full" regex is enormous.
RFC 5321 — Simple Mail Transfer Protocol — source for the 64 / 255 / 254-octet length limits.
HTML Standard — The input element — the canonical WHATWG email pattern and its "willful violation" rationale.
MDN: <input type="email"> — browser validation behavior and the pattern attribute.

Related on iKit

The 25 regex patterns worth memorizing for everyday work — the email pattern fits right into this everyday toolkit of character classes, anchors, and quantifiers.
Test any regex pattern online in about 30 seconds — the step-by-step workflow for validating the strict and loose patterns above against your own test corpus.

Email Regex Patterns Compared: Strict vs Loose (2026)

Email Regex Patterns Compared: Strict vs Loose

TL;DR

Why the RFC 5322 official email regex is useless

What the spec actually permits

Too strict and too lax at the same time

Length limits regex usually ignores

Email regex patterns compared: strict vs loose

The loose pattern (accept almost everything)

The strict pattern (lock down the local part)

Strict vs loose at a glance

What email regex should I actually use?

Use the HTML5 pattern as your default

When to go stricter

Don't rely on regex alone

How to test an email regex without false positives

Build a test corpus of valid and invalid addresses

Watch for catastrophic backtracking

Internationalized (Unicode) email addresses

A quick checklist before you ship

References

Related on iKit

Related posts

Image Converter vs Photoshop Save As: When Browser Wins (2026)

WebP vs AVIF Encoding: Browser Support, Size & Speed (2026)

Postgres extract(epoch) vs MySQL UNIX_TIMESTAMP() (2026)

Email Regex Patterns Compared: Strict vs Loose

#TL;DR

#Why the RFC 5322 official email regex is useless

#What the spec actually permits

#Too strict and too lax at the same time

#Length limits regex usually ignores

#Email regex patterns compared: strict vs loose

#The loose pattern (accept almost everything)

#The strict pattern (lock down the local part)

#Strict vs loose at a glance

#What email regex should I actually use?

#Use the HTML5 pattern as your default

#When to go stricter

#Don't rely on regex alone

#How to test an email regex without false positives

#Build a test corpus of valid and invalid addresses

#Watch for catastrophic backtracking

#Internationalized (Unicode) email addresses

#A quick checklist before you ship

#References

#Related on iKit

Related posts

Image Converter vs Photoshop Save As: When Browser Wins (2026)

WebP vs AVIF Encoding: Browser Support, Size & Speed (2026)

Postgres extract(epoch) vs MySQL UNIX_TIMESTAMP() (2026)

TL;DR

Why the RFC 5322 official email regex is useless

What the spec actually permits

Too strict and too lax at the same time

Length limits regex usually ignores

Email regex patterns compared: strict vs loose

The loose pattern (accept almost everything)

The strict pattern (lock down the local part)

Strict vs loose at a glance

What email regex should I actually use?

Use the HTML5 pattern as your default

When to go stricter

Don't rely on regex alone

How to test an email regex without false positives

Build a test corpus of valid and invalid addresses

Watch for catastrophic backtracking

Internationalized (Unicode) email addresses

A quick checklist before you ship

References

Related on iKit