find non English and Arabic names

If you are experiencing problems with "Everything", post here for assistance.
Post Reply
2late
Posts: 26
Joined: Sun Feb 01, 2015 11:27 am

find non English and Arabic names

Post by 2late » Thu Oct 26, 2017 6:30 pm

hi everyone
how to find non English and Arabic names using Regex ?
English characters:

Code: Select all

[\u0000-\u007F]+
Arabic characters:

Code: Select all

[\u0600-\u06FF]+

therube
Posts: 2851
Joined: Thu Sep 03, 2009 6:48 pm

Re: find non English and Arabic names

Post by therube » Thu Oct 26, 2017 7:17 pm

non English
(what I call, "non-ascii") regex:[^\x00-\x7f]

regex: How to find €

2late
Posts: 26
Joined: Sun Feb 01, 2015 11:27 am

Re: find non English and Arabic names

Post by 2late » Thu Oct 26, 2017 7:28 pm

thanks but help me to understand, please:
why [^\x00-\x7f] can ignore English but [\x00-\x7f] doesn't only match English but also other languages?

Stamimail
Posts: 760
Joined: Sat Aug 31, 2013 9:05 pm

Re: find non English and Arabic names

Post by Stamimail » Thu Oct 26, 2017 10:05 pm

http://www.utf8-chartable.de/

The key to understand the answer is to enable in Everything:
Options > View > [v] Show highlighted search terms

But currently, this important feature doesn't work (or more right to say - disabled ;) ) for items with RTL text... :(
I hope we will get a fix, as sooner as possible.

But I won't let you wait till then. I also can't see the highlighting, but it seems it is due to "digits" and "space".

void
Site Admin
Posts: 6062
Joined: Fri Oct 16, 2009 11:31 pm

Re: find non English and Arabic names

Post by void » Fri Oct 27, 2017 12:25 am

Search for files not including English characters:
regex:[^\x{0000}-\x{007f}]

Search for files containing Arabic characters:
regex:[\x{0600}-\x{06ff}]

pcrepattern man page

Post Reply