MongoDB - Regular Expression


Advertisements

Regular Expressions are frequently used in all languages to search for a pattern or word in any string. MongoDB also provides functionality of regular expression for string pattern matching using the $regex operator. MongoDB uses PCRE (Perl Compatible Regular Expression) as regular expression language.

Unlike text search, we do not need to do any configuration or command to use regular expressions.

Assume we have inserted a document in a database named posts as shown below −

> db.posts.insert(
{
   "post_text": "enjoy the mongodb articles on howcodex",
   "tags": [
      "mongodb",
      "howcodex"
   ]
}
WriteResult({ "nInserted" : 1 })

Using regex Expression

The following regex query searches for all the posts containing string howcodex in it −

> db.posts.find({post_text:{$regex:"howcodex"}}).pretty()
{
	"_id" : ObjectId("5dd7ce28f1dd4583e7103fe0"),
	"post_text" : "enjoy the mongodb articles on howcodex",
	"tags" : [
		"mongodb",
		"howcodex"
	]
}
{
	"_id" : ObjectId("5dd7d111f1dd4583e7103fe2"),
	"post_text" : "enjoy the mongodb articles on howcodex",
	"tags" : [
		"mongodb",
		"howcodex"
	]
}
>

The same query can also be written as −

>db.posts.find({post_text:/howcodex/})

Using regex Expression with Case Insensitive

To make the search case insensitive, we use the $options parameter with value $i. The following command will look for strings having the word howcodex, irrespective of smaller or capital case −

>db.posts.find({post_text:{$regex:"howcodex",$options:"$i"}})

One of the results returned from this query is the following document which contains the word howcodex in different cases −

{
   "_id" : ObjectId("53493d37d852429c10000004"),
   "post_text" : "hey! this is my post on Howcodex", 
   "tags" : [ "howcodex" ]
} 
 

Using regex for Array Elements

We can also use the concept of regex on array field. This is particularly very important when we implement the functionality of tags. So, if you want to search for all the posts having tags beginning from the word tutorial (either tutorial or tutorials or tutorialpoint or tutorialphp), you can use the following code −

>db.posts.find({tags:{$regex:"tutorial"}})

Optimizing Regular Expression Queries

  • If the document fields are indexed, the query will use make use of indexed values to match the regular expression. This makes the search very fast as compared to the regular expression scanning the whole collection.

  • If the regular expression is a prefix expression, all the matches are meant to start with a certain string characters. For e.g., if the regex expression is ^tut, then the query has to search for only those strings that begin with tut.

Advertisements