Regular expression in bash not working

Question

Is there any way in bash so that I can match the patter like that

[0-9]{8}.*.jpg

I have written the above for the following pattern match "First 8 character should be digit and rest of them would be anything and end with .jpg" but the above is not working. if I write in the below manner it's working

[0-9][0-9][0-9][0-9][0-9][0-9][0-9][0-9].*.jpg

Now suppose I want first 20 character must be digit should I repeat the [0-9] 20 times.. I think there is a better solution available which i don't know...

If anyone know please help....

Have you tried =~? And the regex can be \d{8}.*\.jpg. See bash regex match — RedX
– RedX, Commented Sep 2, 2014 at 11:16
How are you trying to use it? Keep in mind that bash only recognizes regular expressions following the =~ operator in a [[ ... ]] command; elsewhere your pattern is treated as a filename glob. — chepner
– chepner, Commented Sep 2, 2014 at 14:18
@CharlesDuffy yeah I studied it a little but then discarded because find together with regex seems to suffice. Now I see jm666 covers it with his answer. — fedorqui
– fedorqui, Commented Sep 2, 2014 at 15:31

Community · Accepted Answer · 2017-05-23 11:57:15Z

3

You can use the regex in find:

find test -regextype posix-extended -regex "^[0-9]{8}.*.jpg$"

Test

$ touch test/12345678aaa.jpg
$ touch test/1234567aaa.jpg

$ find test -regextype posix-extended -regex ".*/[0-9]{8}.*"
test/12345678aaa.jpg

And if it is related to the previous question, you can use:

for file in $(find test -regextype posix-extended -regex ".*/[0-9]{8}.*")
do
   echo "my file is $file"
done

If you create directories and files in them, more matchings can appear:

$ mkdir test/123456789.dir
$ touch test/123456789.dir/1234567890.jpg

You can filter by -type f, so that you just get files:

$ find test -type f -regextype posix-extended -regex ".*/[0-9]{8}.*"
test/12345678aaa.jpg
test/123456789.dir/1234567890.jpg

And/or specify the depth of the find, so that it does not contain subdirectories:

$ find test -maxdepth 1 -type f -regextype posix-extended -regex ".*/[0-9]{8}.*"
test/12345678aaa.jpg

edited May 23, 2017 at 11:57

CommunityBot

11 silver badge

answered Sep 2, 2014 at 11:21

fedorqui

294k113 gold badges593 silver badges641 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

John Zwinck · Accepted Answer · 2014-09-02 11:24:24Z

2

It looks like you're trying to generate a list of filenames from a regular expression. You can do that, but not directly from Bash as far as I know. Instead, use find:

find -E . -regex '.*/[0-9]{8}.*\.jpg' -depth 1

Something like that works on my Mac OS X system; on Linux the . for current directory is optional, or you can specify a different directory to search in. I added -depth 1 to avoid descending into subdirectories.

answered Sep 2, 2014 at 11:24

John Zwinck

252k44 gold badges347 silver badges459 bronze badges

3 Comments

Charles Duffy Over a year ago

@JatinKhurana, it escapes the literal . in .jpg, which otherwise matches any character. You could leave it out if writing the extension as [.]jpg, which is how I would do it.

Jatin Khurana Over a year ago

sorry for late reply(was busy in my exam)... but I was asking for the first one */[0-9]... in the previous ....

John Zwinck Over a year ago

.*/ matches the directory part of the paths.

clt60 · Accepted Answer · 2014-09-02 15:46:31Z

A bit late answer.

Bash's filename exapnsion patterns ( called globbing ) has it's own rules. They're exists in two forms:

simple globbing
extended globbing (if you have enabled shopts -s extglob

You can read about the both rules for example here. (3.5.8.1 Pattern Matching)

You should remember, the globbing rules aren't the traditional regular expressions (as you probably know for grep or sed and such), and especially they're not the perl's (extended) regular expressions.

So, if you want use filename expansion (aka globbing) you're stuck with the above two (simple/extended) pattern rules. Of course, bash knows regular expressions, but not for filename-expansion (globbing).

So, you can for example do the next:

shopt -s globstar        #if you haven't already enabled - for the ** expansion
regex="[0-9]{8}.*\.jpg"
for file in ./**/*.jpg   #will match all *.jpg recusrively (globstar)
do
    #try the regex matching
    [[ $file =~ $regex ]] || continue  #didn't match

    #matched! - do something with the file
    echo "the $file has at least 8 digits"
done

or you can use, the find command with the built-in regex matching rules (see other answers), or the grep with perl-like regexes, such:

find somewhere -type f -name \*.jpg -maxdepth 1 -print0 | grep -zP '/\d{8}.*.jpg'

The speed: for the large trees the find is faster. At least on my notebook, where:

while IFS= read -d $'\0' -r file
do
    echo "$file"
done < <(find ~/Pictures -name \*.JPG -print0 | grep -zP 'P\d{4}.*\.JPG')

runs real 0m1.593s, and the

regex="P[0-9]{4}.*\.JPG"
for file in ~/Pictures/**/*.JPG
do
    [[ $file =~ $regex ]] || continue  #didn't match

    echo "$file"
done

runs real 0m3.628s seconds.

On the small trees, IMHO is better to use the builting bash regexes. (maybe, I prefer it because i like the ./**/*.ext expansion, and got all filenames correctly inside the variable, regardless of spaces and like, without the care about the -print0 and read -d $'\0; and such...)

Very very good approach, with good explanation, references and comparisons! Picky question: what do you prefer? I like the way find handles it and I would go for it in case it is just a matter of listing files.

Collectives™ on Stack Overflow

Regular expression in bash not working

3 Answers 3

Test

Comments

3 Comments

1 Comment

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Test

Comments

3 Comments

1 Comment

Your Answer

Sign up or log in

Post as a guest

Linked

Related