Question 1

What is URL encoding (percent encoding)?

Accepted Answer

URL encoding (also called percent encoding, defined in RFC 3986) converts characters that are unsafe or have special meaning in URLs into a percent sign followed by two hexadecimal digits representing the character's byte value. For example, a space becomes %20, an ampersand becomes %26, and non-ASCII characters are first converted to UTF-8 bytes, each percent-encoded individually. This ensures URLs are transmitted correctly across all systems, since URLs can only contain a limited set of ASCII characters.

Question 2

What is the difference between encodeURI() and encodeURIComponent()?

Accepted Answer

encodeURI() encodes a complete URI, leaving reserved characters like : / ? # and & untouched since they have structural meaning in URLs. encodeURIComponent() encodes everything except unreserved characters (letters, digits, -, _, ., ~), making it suitable for encoding individual query parameter values where characters like & and = should be treated as literal text. Use encodeURI() for full URLs and encodeURIComponent() for parameter values. Using the wrong one is a common source of bugs.

Question 3

Which characters need to be URL-encoded?

Accepted Answer

Characters that must be encoded include spaces (as %20 or + in form data), all non-ASCII characters (UTF-8 bytes encoded individually), and reserved characters when used outside their designated structural purpose. Reserved characters include : / ? # [ ] @ ! $ & ' ( ) * + , ; =. Unreserved characters that never need encoding are uppercase and lowercase letters (A-Z, a-z), digits (0-9), and four symbols: hyphen (-), underscore (_), period (.), and tilde (~).

Question 4

Why does my URL have %20 or + for spaces?

Accepted Answer

Both %20 and + represent spaces in URLs, but in different contexts. %20 is the standard RFC 3986 percent-encoding for a space character and works everywhere in a URL (scheme, path, query, fragment). The + sign represents a space only in the application/x-www-form-urlencoded format used by HTML form submissions, and only within the query string. JavaScript's encodeURIComponent() produces %20, while URLSearchParams and HTML forms produce +. For maximum compatibility, %20 is safer.

Question 5

What happens if I double-encode a URL?

Accepted Answer

Double encoding occurs when an already-encoded string is encoded again, converting percent signs into %25. For example, a space encoded once becomes %20, but encoded twice becomes %2520 (the % is encoded to %25, followed by 20). This is a common bug in web applications that causes broken links, failed API requests, and garbled text. To avoid it, only encode raw user input once, and always decode before re-encoding. If you receive a URL with %25 sequences, it has likely been double-encoded and needs only one round of decoding.

Question 6

How does URL encoding handle non-English characters like Chinese or Arabic?

Accepted Answer

Non-ASCII characters are first converted to their UTF-8 byte representation, then each byte is percent-encoded individually. For example, the Chinese character for sun (U+65E5) has the UTF-8 bytes E6, 97, A5, so it becomes %E6%97%A5. A single emoji like the smiley face (U+1F600) uses four UTF-8 bytes and becomes %F0%9F%98%80. Modern browsers display the original characters in the address bar using Internationalized Resource Identifiers (IRIs), while the actual HTTP request uses the percent-encoded form. This system allows URLs to contain text in any language.

Category	Characters
Uppercase letters	A B C D E F G H I J K L M N O P Q R S T U V W X Y Z
Lowercase letters	a b c d e f g h i j k l m n o p q r s t u v w x y z
Digits	0 1 2 3 4 5 6 7 8 9
Special safe chars	- _ . ~

Character	Encoded	URL Purpose
:	%3A	Scheme separator (http:), port (:8080)
/	%2F	Path separator
?	%3F	Query string start
#	%23	Fragment identifier
&	%26	Query parameter separator
=	%3D	Key-value separator in query
@	%40	User info separator
+	%2B	Space in form data (legacy)
(space)	%20	Not allowed in URLs

Language	Encode Function	Decode Function	Notes
JavaScript	`encodeURIComponent()`	`decodeURIComponent()`	Also: encodeURI() for full URLs
Python	`urllib.parse.quote()`	`urllib.parse.unquote()`	quote_plus() for form data
PHP	`rawurlencode()`	`rawurldecode()`	urlencode() uses + for spaces
Java	`URLEncoder.encode()`	`URLDecoder.decode()`	Uses + for spaces (legacy)
Go	`url.QueryEscape()`	`url.QueryUnescape()`	PathEscape for path segments
C#	`Uri.EscapeDataString()`	`Uri.UnescapeDataString()`	Avoid HttpUtility.UrlEncode for RFC 3986

URL Encoder & Decoder

URL Encoding Explained: Percent Encoding, Reserved Characters, and the encodeURI vs encodeURIComponent Distinction

URL Character Categories: Reserved, Unreserved, and Unsafe

Unreserved Characters (never need encoding)

Reserved Characters (have special meaning in URLs)

encodeURI() vs encodeURIComponent(): The Critical Difference

How Percent Encoding Works with Unicode (UTF-8)

Spaces in URLs: %20 vs + (Plus Sign)

When to Encode: Practical Scenarios

URL Encoding in Different Languages

Common URL Encoding Mistakes

How This Tool Works

Frequently Asked Questions

Related Developer Tools