SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

General discussion related to "Everything".
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

Post by Debugger »

This document contains characters... (Current Encoding: 1250).


How to find/DETECT a strange character?
This is strange for me because it is a simple text in Polish, so the 1250 encoding is 100% correct.
Last edited by Debugger on Tue Mar 26, 2019 12:06 pm, edited 3 times in total.
void
Developer
Posts: 19899
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find a strange character?

Post by void »

To search for any character with PCRE regex not in code page 1250:

Code: Select all

[^\x{0000}\x{0001}\x{0002}\x{0003}\x{0004}\x{0005}\x{0006}\x{0007}\x{0008}\x{0009}\x{000a}\x{000b}\x{000c}\x{000d}\x{000e}\x{000f}\x{0010}\x{0011}\x{0012}\x{0013}\x{0014}\x{0015}\x{0016}\x{0017}\x{0018}\x{0019}\x{001a}\x{001b}\x{001c}\x{001d}\x{001e}\x{001f}\x{0020}\x{0021}\x{0022}\x{0023}\x{0024}\x{0025}\x{0026}\x{0027}\x{0028}\x{0029}\x{002A}\x{002B}\x{002C}\x{002D}\x{002E}\x{002F}\x{030}\x{0031}\x{0032}\x{0033}\x{0034}\x{0035}\x{0036}\x{0037}\x{0038}\x{0039}\x{003A}\x{003B}\x{003C}\x{003D}\x{003E}\x{003F}\x{040}\x{0041}\x{0042}\x{0043}\x{0044}\x{0045}\x{0046}\x{0047}\x{0048}\x{0049}\x{004A}\x{004B}\x{004C}\x{004D}\x{004E}\x{004F}\x{050}\x{0051}\x{0052}\x{0053}\x{0054}\x{0055}\x{0056}\x{0057}\x{0058}\x{0059}\x{005A}\x{005B}\x{005C}\x{005D}\x{005E}\x{005F}\x{060}\x{0061}\x{0062}\x{0063}\x{0064}\x{0065}\x{0066}\x{0067}\x{0068}\x{0069}\x{006A}\x{006B}\x{006C}\x{006D}\x{006E}\x{006F}\x{070}\x{0071}\x{0072}\x{0073}\x{0074}\x{0075}\x{0076}\x{0077}\x{0078}\x{0079}\x{007A}\x{007B}\x{007C}\x{007D}\x{007E}\x{007F}\x{0AC}\x{201A}\x{201E}\x{2026}\x{2020}\x{2021}\x{2030}\x{0160}\x{2039}\x{015A}\x{0164}\x{017D}\x{0179}\x{018}\x{2019}\x{201C}\x{201D}\x{2022}\x{2013}\x{2014}\x{2122}\x{0161}\x{203A}\x{015B}\x{0165}\x{017E}\x{017A}\x{0A0}\x{02C7}\x{02D8}\x{0141}\x{00A4}\x{0104}\x{00A6}\x{00A7}\x{00A8}\x{00A9}\x{015E}\x{00AB}\x{00AC}\x{00AD}\x{00AE}\x{017B}\x{0B0}\x{00B1}\x{02DB}\x{0142}\x{00B4}\x{00B5}\x{00B6}\x{00B7}\x{00B8}\x{0105}\x{015F}\x{00BB}\x{013D}\x{02DD}\x{013E}\x{017C}\x{154}\x{00C1}\x{00C2}\x{0102}\x{00C4}\x{0139}\x{0106}\x{00C7}\x{010C}\x{00C9}\x{0118}\x{00CB}\x{011A}\x{00CD}\x{00CE}\x{010E}\x{110}\x{0143}\x{0147}\x{00D3}\x{00D4}\x{0150}\x{00D6}\x{00D7}\x{0158}\x{016E}\x{00DA}\x{0170}\x{00DC}\x{00DD}\x{0162}\x{00DF}\x{155}\x{00E1}\x{00E2}\x{0103}\x{00E4}\x{013A}\x{0107}\x{00E7}\x{010D}\x{00E9}\x{0119}\x{00EB}\x{011B}\x{00ED}\x{00EE}\x{010F}\x{111}\x{0144}\x{0148}\x{00F3}\x{00F4}\x{0151}\x{00F6}\x{00F7}\x{0159}\x{016F}\x{00FA}\x{0171}\x{00FC}\x{00FD}\x{0163}\x{02D9}]
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

He finds nothing.
I tested in Notepad++ and EmEditor.

Image





Image
tuska
Posts: 1385
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote: Sat Mar 16, 2019 7:03 am He finds nothing.
I tested in ... EmEditor.
Any character with PCRE regex not in code page 1250.png
You do not have the required permissions to view the files attached to this post.
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)

A strange symbol can also be something that can not be seen.

I have no idea why can not save text with the default encoding.
Which symbol, unicode, special character, it makes it impossible and how you save eg without UTF-8 you will get the name

Code: Select all

??
tuska
Posts: 1385
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote:Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)
That's just the find toolbar (menu "View" - "Toolbars"...) - I use EmEditor Pro Version 18.6.91 x64.

If I want to save a file with special characters (as shown in the picture) as .txt file,
then "Save as Unicode (UTF-16LE with signature)" is automatically suggested to me...
The storage in this format allows me to open the text file without any problems and later on
saving without any further request.
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

I understand. Filter Toolbar
but he still does not detect the strange char
I want to save as 1250 - for the Polish text. I do not need any UTF-16LE.
I also tried online tools to detect strange characters, but they also can not detect strange char.


Detects the Polish text, but does not detect strange characters.
[a-żA-Ż ?,:]
tuska
Posts: 1385
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska »

Debugger wrote:I understand. Filter Toolbar
No, I meant the "Find Toolbar"! -> Menu "View" - "Toolbars" - "Find Toolbar"

Did you paste the code in the field "Find" of the "Find Toolbar" and
- clicked on button "Use Regular Expressions" in the "Find Toolbar" and
- then clicked on "Find Next" in the "Find Toolbar"?
I then get the special characters marked as shown.
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger »

But in my text there are no special characters you mentioned.

I have checked thousands of different Unicode, and still do not detect a strange or illegal symbol.

[^\x00-\x7F]+