SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

General discussion related to "Everything".
Post Reply
Debugger
Posts: 489
Joined: Thu Jan 26, 2017 11:56 am

SOLVED: How to find: Blank Char / Invisible Character AND Hidde Char

Post by Debugger » Fri Mar 15, 2019 12:31 pm

This document contains characters... (Current Encoding: 1250).


How to find/DETECT a strange character?
This is strange for me because it is a simple text in Polish, so the 1250 encoding is 100% correct.
Last edited by Debugger on Tue Mar 26, 2019 12:06 pm, edited 3 times in total.

void
Site Admin
Posts: 4666
Joined: Fri Oct 16, 2009 11:31 pm

Re: How to find a strange character?

Post by void » Sat Mar 16, 2019 2:25 am

To search for any character with PCRE regex not in code page 1250:

Code: Select all

[^\x{0000}\x{0001}\x{0002}\x{0003}\x{0004}\x{0005}\x{0006}\x{0007}\x{0008}\x{0009}\x{000a}\x{000b}\x{000c}\x{000d}\x{000e}\x{000f}\x{0010}\x{0011}\x{0012}\x{0013}\x{0014}\x{0015}\x{0016}\x{0017}\x{0018}\x{0019}\x{001a}\x{001b}\x{001c}\x{001d}\x{001e}\x{001f}\x{0020}\x{0021}\x{0022}\x{0023}\x{0024}\x{0025}\x{0026}\x{0027}\x{0028}\x{0029}\x{002A}\x{002B}\x{002C}\x{002D}\x{002E}\x{002F}\x{030}\x{0031}\x{0032}\x{0033}\x{0034}\x{0035}\x{0036}\x{0037}\x{0038}\x{0039}\x{003A}\x{003B}\x{003C}\x{003D}\x{003E}\x{003F}\x{040}\x{0041}\x{0042}\x{0043}\x{0044}\x{0045}\x{0046}\x{0047}\x{0048}\x{0049}\x{004A}\x{004B}\x{004C}\x{004D}\x{004E}\x{004F}\x{050}\x{0051}\x{0052}\x{0053}\x{0054}\x{0055}\x{0056}\x{0057}\x{0058}\x{0059}\x{005A}\x{005B}\x{005C}\x{005D}\x{005E}\x{005F}\x{060}\x{0061}\x{0062}\x{0063}\x{0064}\x{0065}\x{0066}\x{0067}\x{0068}\x{0069}\x{006A}\x{006B}\x{006C}\x{006D}\x{006E}\x{006F}\x{070}\x{0071}\x{0072}\x{0073}\x{0074}\x{0075}\x{0076}\x{0077}\x{0078}\x{0079}\x{007A}\x{007B}\x{007C}\x{007D}\x{007E}\x{007F}\x{0AC}\x{201A}\x{201E}\x{2026}\x{2020}\x{2021}\x{2030}\x{0160}\x{2039}\x{015A}\x{0164}\x{017D}\x{0179}\x{018}\x{2019}\x{201C}\x{201D}\x{2022}\x{2013}\x{2014}\x{2122}\x{0161}\x{203A}\x{015B}\x{0165}\x{017E}\x{017A}\x{0A0}\x{02C7}\x{02D8}\x{0141}\x{00A4}\x{0104}\x{00A6}\x{00A7}\x{00A8}\x{00A9}\x{015E}\x{00AB}\x{00AC}\x{00AD}\x{00AE}\x{017B}\x{0B0}\x{00B1}\x{02DB}\x{0142}\x{00B4}\x{00B5}\x{00B6}\x{00B7}\x{00B8}\x{0105}\x{015F}\x{00BB}\x{013D}\x{02DD}\x{013E}\x{017C}\x{154}\x{00C1}\x{00C2}\x{0102}\x{00C4}\x{0139}\x{0106}\x{00C7}\x{010C}\x{00C9}\x{0118}\x{00CB}\x{011A}\x{00CD}\x{00CE}\x{010E}\x{110}\x{0143}\x{0147}\x{00D3}\x{00D4}\x{0150}\x{00D6}\x{00D7}\x{0158}\x{016E}\x{00DA}\x{0170}\x{00DC}\x{00DD}\x{0162}\x{00DF}\x{155}\x{00E1}\x{00E2}\x{0103}\x{00E4}\x{013A}\x{0107}\x{00E7}\x{010D}\x{00E9}\x{0119}\x{00EB}\x{011B}\x{00ED}\x{00EE}\x{010F}\x{111}\x{0144}\x{0148}\x{00F3}\x{00F4}\x{0151}\x{00F6}\x{00F7}\x{0159}\x{016F}\x{00FA}\x{0171}\x{00FC}\x{00FD}\x{0163}\x{02D9}]

Debugger
Posts: 489
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger » Sat Mar 16, 2019 7:03 am

He finds nothing.
I tested in Notepad++ and EmEditor.

Image





Image

tuska
Posts: 152
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska » Sat Mar 16, 2019 10:50 am

Debugger wrote:
Sat Mar 16, 2019 7:03 am
He finds nothing.
I tested in ... EmEditor.
Any character with PCRE regex not in code page 1250.png
You do not have the required permissions to view the files attached to this post.

Debugger
Posts: 489
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger » Sat Mar 16, 2019 11:18 am

Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)

A strange symbol can also be something that can not be seen.

I have no idea why can not save text with the default encoding.
Which symbol, unicode, special character, it makes it impossible and how you save eg without UTF-8 you will get the name

Code: Select all

??

tuska
Posts: 152
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska » Sat Mar 16, 2019 11:32 am

Debugger wrote:Your screen presents a differently configured Emeditor interface, or a version other than the latest version of EmEditor (Pro version)
That's just the find toolbar (menu "View" - "Toolbars"...) - I use EmEditor Pro Version 18.6.91 x64.

If I want to save a file with special characters (as shown in the picture) as .txt file,
then "Save as Unicode (UTF-16LE with signature)" is automatically suggested to me...
The storage in this format allows me to open the text file without any problems and later on
saving without any further request.

Debugger
Posts: 489
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger » Sat Mar 16, 2019 11:54 am

I understand. Filter Toolbar
but he still does not detect the strange char
I want to save as 1250 - for the Polish text. I do not need any UTF-16LE.
I also tried online tools to detect strange characters, but they also can not detect strange char.


Detects the Polish text, but does not detect strange characters.
[a-żA-Ż ?,:]

tuska
Posts: 152
Joined: Thu Jul 13, 2017 9:14 am

Re: How to find a strange character?

Post by tuska » Sat Mar 16, 2019 12:49 pm

Debugger wrote:I understand. Filter Toolbar
No, I meant the "Find Toolbar"! -> Menu "View" - "Toolbars" - "Find Toolbar"

Did you paste the code in the field "Find" of the "Find Toolbar" and
- clicked on button "Use Regular Expressions" in the "Find Toolbar" and
- then clicked on "Find Next" in the "Find Toolbar"?
I then get the special characters marked as shown.

Debugger
Posts: 489
Joined: Thu Jan 26, 2017 11:56 am

Re: How to find a strange character?

Post by Debugger » Sat Mar 16, 2019 1:14 pm

But in my text there are no special characters you mentioned.

I have checked thousands of different Unicode, and still do not detect a strange or illegal symbol.

[^\x00-\x7F]+

Post Reply