PostgreSQL String Functions

Last Updated : 11 Oct, 2024

PostgreSQL is a powerful, open-source relational database management system that offers a rich set of functions and operators for working with string data. String manipulation is an essential task in many applications, and PostgreSQL provides a variety of built-in functions to make working with text more efficient and flexible.

In this article, we will explain PostgreSQL string functions, from basic operations like string concatenation to more advanced tasks such as pattern matching and text encoding.

PostgreSQL String Functions

PostgreSQL string functions enable developers to perform various operations on text data, allowing for efficient manipulation, formatting, and querying. These functions range from simple tasks like concatenating and modifying strings to more complex operations like pattern matching and encoding.

1. Basic String Functions

`LENGTH()`

This function returns the total number of characters in a string, including spaces and punctuation. It helps in evaluating the size of text fields in our database.

LENGTH(string)

Example

SELECT LENGTH('PostgreSQL');

Output:

length
10

Explanation:

It calculates the total number of characters in the string 'PostgreSQL'. The function counts all characters, including both uppercase and lowercase letters. The output is 10, which indicates that the string consists of 10 characters in total, from the 'P' in "PostgreSQL" to the 'L'. This includes no spaces or special characters, as the string is continuous

CHAR_LENGTH() or CHARACTER_LENGTH()

This function is same as the LENGTH() function but more descriptive. It also returns the number of characters in a string. It is useful when clarity in function naming is important.

Syntax:

CHAR_LENGTH(string)

Example

SELECT CHAR_LENGTH('PostgreSQL');

Output:

char_length
10

Explanation:

It returns the number of characters in the string 'PostgreSQL'. Similar to the LENGTH() function, it counts each character in the string, including uppercase and lowercase letters. The result is 10, indicating that the string 'PostgreSQL' is made up of 10 characters.

UPPER() and LOWER()

These functions convert a string to uppercase or lowercase, respectively. They're useful for standardizing text for comparison or display.

Syntax:

UPPER(string)
LOWER(string)

Example

SELECT UPPER('PostgreSQL'), LOWER('PostgreSQL');

Output:

upper	lower
POSTGRESQL	postgresql

Explanation:

The UPPER() function transforms all the letters to uppercase, resulting in "POSTGRESQL", while the LOWER() function converts all the characters to lowercase, producing "postgresql". These functions are useful for standardizing text data for comparison or formatting purposes.

INITCAP()

The INITCAP() function capitalizes the first letter of each word in a string while converting the rest to lowercase. It's often used for formatting names or titles.

Syntax:

INITCAP(string)

Example

SELECT INITCAP('hello world');

Output:

initcap
Hello World

Explanation:

It capitalizes the first letter of each word in the string 'hello world', while converting all other letters to lowercase. As a result, the output is "Hello World", where both "Hello" and "World" have their first letters capitalized. This function is often used for formatting names, titles, or sentences where proper capitalization is required.

CONCAT() and || (String Concatenation)

The CONCAT() function is used to join multiple strings together, while the || operator offers a shorthand for concatenation. These are useful for merging fields like first and last names.

Syntax:

CONCAT(string1, string2, ...)

Example 1

SELECT CONCAT('Hello', ' ', 'World');

Output:

concat
Hello World

Explanation:

The CONCAT() function combines these into a single string, resulting in "Hello World". This is useful for merging text or fields, such as combining first and last names or constructing readable messages from multiple parts of text.

Example 2

SELECT 'Hello' || ' ' || 'World' AS greeting;

Output:

greeting
Hello World

Explanation:

The || operator joins these strings together into a single output, "Hello World". The space ensures that "Hello" and "World" are separated by a space. The || operator provides a shorthand way to concatenate strings in PostgreSQL and is widely used for combining text efficiently in queries.

2. String Manipulation Functions

SUBSTRING() (or SUBSTR())

The SUBSTRING() function extracts a portion of a string, starting from a specific position and for a specified length. It is used to retrieve part of a string, such as a substring of a product code.

Syntax:

SUBSTRING(string FROM start FOR length)

Example

SELECT SUBSTRING('PostgreSQL' FROM 1 FOR 4);

Output:

substring
Post

Explanation:

It starts from position 1 (the first character, 'P') and retrieves the next 4 characters, resulting in the substring "Post". This function is useful when we need to extract specific parts of a string, such as prefixes, codes, or abbreviations, based on their position and length within the original string.

LEFT() and RIGHT()

LEFT() retrieves a specified number of characters from the start of a string, while RIGHT() does the same from the end. These are handy for parsing codes or abbreviations.

Syntax:

LEFT(string, number_of_characters)
RIGHT(string, number_of_characters)

Example

SELECT LEFT('PostgreSQL', 4), RIGHT('PostgreSQL', 4);

Output:

left	right
Post	SQL

Explanation:

These functions are particularly useful for extracting specific segments of a string, such as prefixes or suffixes, which can be essential for parsing data or formatting output in queries. The outputs demonstrate how easily we can manipulate string data based on its position within the overall string.

TRIM()

The TRIM() function removes leading, trailing, or both kinds of spaces or specific characters from a string. It’s useful for cleaning up user input or removing unwanted characters.

Syntax:

TRIM([LEADING | TRAILING | BOTH] [characters] FROM string)

Example 1

SELECT TRIM('  PostgreSQL  ');

Output:

trim
PostgreSQL

Explanation:

The output is "PostgreSQL", which contains no extra spaces before or after the text. The TRIM() function is essential for cleaning up user input or data retrieved from databases, ensuring that extra spaces do not affect data integrity or cause issues in comparisons and searches.

Example 2

SELECT TRIM(BOTH '-' FROM '-PostgreSQL-');

Output:

trim
PostgreSQL

Explanation:

The result is "PostgreSQL", which has no leading or trailing hyphens. This function is particularly useful for cleaning up strings that may have been formatted with specific characters, ensuring that the final output is tidy and free of unwanted symbols.

3. String Comparison Functions

POSITION()

The POSITION() function returns the position of the first occurrence of a substring within a string. In this example, the substring 'gres' starts at position 5 in the string 'PostgreSQL'.

Syntax:

POSITION(substring IN string)

Example

SELECT POSITION('gres' IN 'PostgreSQL');

Output:

position
5

Explanation:

This function is useful for determining where a specific substring occurs, which can be helpful in parsing text, validating data, or performing operations based on the location of a substring within a larger string. The position is counted from the beginning of the string, with the first character starting at position 1.

STRPOS()

The STRPOS() function is similar to POSITION() but is more readable. It returns the position of a substring’s first occurrence. It's useful for checking if a substring exists and where it starts.

Syntax:

SELECT STRPOS('PostgreSQL', 'SQL');

Output:

strpos
9

CONCAT_WS()

The CONCAT_WS() function is Similar to CONCAT(), but with a separator between strings. It’s useful when concatenating fields with commas or dashes.

Syntax:

CONCAT_WS(separator, string1, string2, ...)

Example

SELECT CONCAT_WS('-', 'Post', 'greSQL', 'rocks');

Output:

concat_ws
Post-greSQL-rocks

4. Pattern Matching and Regular Expressions

LIKE and ILIKE

These operators are used for simple pattern matching. LIKE is case-sensitive, while ILIKE is case-insensitive, making them ideal for searching and filtering strings.

Syntax:

string LIKE pattern
string ILIKE pattern

Example 1

This checks if the string 'PostgreSQL' starts with 'Post'.

Query:

SELECT 'PostgreSQL' LIKE 'Post%';

Output:

Like
t

Explanation:

The output is t, which stands for true, indicating that the condition is met. The % symbol acts as a wildcard that matches any sequence of characters following 'Post', allowing for flexible pattern matching.

Example 2

SELECT 'postgresql' ILIKE 'POST%';

Output:

ilike
t

Explanation:

The output is t, which indicates true, meaning that the condition is satisfied. The ILIKE operator functions similarly to LIKE, but it performs a case-insensitive comparison, allowing for more flexible matching

SIMILAR TO

The SIMILAR TO operator is used for pattern matching with SQL regular expressions, providing more flexibility than LIKE. It supports complex string matching scenarios.

Syntax:

string SIMILAR TO pattern

Example

SELECT 'PostgreSQL' SIMILAR TO 'Post[[:upper:]]%';

Output:

similar to
t

Explanation:

The output is t, indicating true, which confirms that 'PostgreSQL' follows to the specified pattern. The SIMILAR TO operator allows for more complex pattern matching than LIKE, making it useful for validating formats or structures in strings that require specific character types.

REGEXP_MATCHES()

For more advanced regular expressions, PostgreSQL offers the REGEXP_MATCHES() function, which returns all the matches of a regular expression in a string.

Syntax:

REGEXP_MATCHES(string, pattern)

Example

SELECT REGEXP_MATCHES('PostgreSQL is great', '\w+');

Output:

regexp_matches
{PostgreSQL}

Explanation:

The output {PostgreSQL} indicates that the function found a match for the pattern, returning the first word "PostgreSQL". The regular expression \w+ matches one or more consecutive word characters, making this function particularly useful for extracting words or tokens from text strings

REGEXP_REPLACE()

The REGEXP_REPLACE() function replaces occurrences of a pattern in a string using regular expressions. This replaces all words (\w+) in the string with the word "Database".

Syntax:

REGEXP_REPLACE(string, pattern, replacement)

Example

SELECT REGEXP_REPLACE('PostgreSQL is great', '\w+', 'Database');

Output:

regexp_replace
Database Database Database

Explanation:

The output is Database Database Database, indicating that every word in the original string has been replaced by "Database". The regular expression \w+ matches each word, and the function effectively transforms the entire sentence into a series of the same word. This functionality is useful for data masking, standardization, or modifying text based on specific patterns.

Conclusion

PostgreSQL provides a rich set of string functions and operators that enable developers to manipulate and analyze string data with ease. From basic string functions like LENGTH() and UPPER() to more advanced operations such as regular expression matching and encoding.

Additionally, the ability to easily format, search, and modify strings makes PostgreSQL a powerful tool for developers working with text-heavy applications. By mastering these string functions and operators, users can significantly improve their data processing capabilities and create more efficient queries.

PostgreSQL - Substring Function

kumarsar29u2

Improve

Article Tags :

PostgreSQL String Functions

PostgreSQL String Functions

1. Basic String Functions

LENGTH()

CHAR_LENGTH() or CHARACTER_LENGTH()

UPPER() and LOWER()

INITCAP()

CONCAT() and || (String Concatenation)

2. String Manipulation Functions

SUBSTRING() (or SUBSTR())

LEFT() and RIGHT()

TRIM()

3. String Comparison Functions

POSITION()

STRPOS()

CONCAT_WS()

4. Pattern Matching and Regular Expressions

LIKE and ILIKE

SIMILAR TO

REGEXP_MATCHES()

REGEXP_REPLACE()

Conclusion

Similar Reads

Thank You!

What kind of Experience do you want to share?

`LENGTH()`