report a bug regarding Unicode character preservation when copying filenames/paths to the clipboard in Everything 1.5

Found a bug in "Everything"? report it here
Post Reply
Debugger
Posts: 719
Joined: Thu Jan 26, 2017 11:56 am

report a bug regarding Unicode character preservation when copying filenames/paths to the clipboard in Everything 1.5

Post by Debugger »

I would like to report a bug regarding Unicode character preservation when copying filenames/paths to the clipboard in Everything 1.5.

Bug Description:
When using the "Copy Name" or "Copy Path" function on a file containing full-width CJK brackets ( \x{ff08} and ) \x{ff09}, the clipboard drops or downgrades these characters. When I paste the text into a strict Unicode text editor (like EmEditor), the full-width brackets are incorrectly pasted as standard half-width ASCII brackets ( \x{0028} and ) \x{0029}.

This creates severe issues when comparing lists, as the physical file on the disk retains the true full-width brackets, but the copied text from Everything contains altered characters.

Steps to reproduce:
1. Create a file named:

Code: Select all

xxx(xx
2. Search for it in Everything 1.5a
3. Right-click the file and select "Copy Name".
4. Paste it into a Unicode-aware editor (EmEditor / VS Code) and check the binary/hex code of the bracket. It changes from \x{ff08} to \x{0028}.

It seems Everything might be pushing ANSI or poorly normalized text to the Windows Clipboard instead of preserving the exact raw UTF-16/Unicode sequence.

My Environment:
- Everything version: 1.5a
- OS: Windows 10/11 x64

Thank you for looking into this!



Example Filename:

U+0033 : DIGIT THREE
U+0033 : DIGIT THREE
U+0037 : DIGIT SEVEN
U+0039 : DIGIT NINE
U+0031 : DIGIT ONE
U+0020 : SPACE [SP]
U+002D : HYPHEN-MINUS {hyphen, dash; minus sign}
U+0020 : SPACE [SP]
U+0050 : LATIN CAPITAL LETTER P
U+0041 : LATIN CAPITAL LETTER A
U+004D : LATIN CAPITAL LETTER M
U+0061 : LATIN SMALL LETTER A
U+006A : LATIN SMALL LETTER J
U+006F : LATIN SMALL LETTER O
U+0072 : LATIN SMALL LETTER R
U+0020 : SPACE [SP]
U+002D : HYPHEN-MINUS {hyphen, dash; minus sign}
U+0020 : SPACE [SP]
U+522B : CJK UNIFIED IDEOGRAPH-522B
U+56DE : CJK UNIFIED IDEOGRAPH-56DE
U+5934 : CJK UNIFIED IDEOGRAPH-5934
U+0028 : LEFT PARENTHESIS
U+0044 : LATIN CAPITAL LETTER D
U+006A : LATIN SMALL LETTER J
U+963F : CJK UNIFIED IDEOGRAPH-963F
U+6587 : CJK UNIFIED IDEOGRAPH-6587
U+0020 : SPACE [SP]
U+0045 : LATIN CAPITAL LETTER E
U+006C : LATIN SMALL LETTER L
U+0065 : LATIN SMALL LETTER E
U+0063 : LATIN SMALL LETTER C
U+0074 : LATIN SMALL LETTER T
U+0072 : LATIN SMALL LETTER R
U+006F : LATIN SMALL LETTER O
U+0020 : SPACE [SP]
U+0052 : LATIN CAPITAL LETTER R
U+0065 : LATIN SMALL LETTER E
U+006D : LATIN SMALL LETTER M
U+0069 : LATIN SMALL LETTER I
U+0078 : LATIN SMALL LETTER X
U+0029 : RIGHT PARENTHESIS
U+005B : LEFT SQUARE BRACKET
U+0077 : LATIN SMALL LETTER W
U+0077 : LATIN SMALL LETTER W
U+0077 : LATIN SMALL LETTER W
U+002E : FULL STOP {period, dot, decimal point}
U+0064 : LATIN SMALL LETTER D
U+006A : LATIN SMALL LETTER J
U+0039 : DIGIT NINE
U+0033 : DIGIT THREE
U+002E : FULL STOP {period, dot, decimal point}
U+0063 : LATIN SMALL LETTER C
U+006F : LATIN SMALL LETTER O
U+006D : LATIN SMALL LETTER M
U+005D : RIGHT SQUARE BRACKET
U+002E : FULL STOP {period, dot, decimal point}
U+006D : LATIN SMALL LETTER M
U+0070 : LATIN SMALL LETTER P
U+0033 : DIGIT THREE
void
Developer
Posts: 19830
Joined: Fri Oct 16, 2009 11:31 pm

Re: report a bug regarding Unicode character preservation when copying filenames/paths to the clipboard in Everything 1.

Post by void »

Thank you for the bug report Debugger,

I am unable to reproduce the issue here.

Using a clipboard viewer, I see the following after right-clicking
xxx(xx
and clicking Copy Name:

CF_UNICODETEXT

Code: Select all

00000000  78 00 78 00  78 00 08 FF  x.x.x...
00000008  78 00 78 00  00 00        x.x...
CF_LOCALE

Code: Select all

00000000  09 0C 00 00               ....
CF_TEXT

Code: Select all

00000000  78 78 78 28  78 78 00     xxx(xx.
CF_OEMTEXT

Code: Select all

00000000  78 78 78 28  78 78 00     xxx(xx.
The correct Unicode text is stored on the clipboard.
The clipboard data is identical to the copied text from Notepad or from the inline renaming.

Can you reproduce the issue by copying the
xxx(xx
text from Notepad or by selecting
xxx(xx
, pressing F2 and copying the name from the inline rename?

Does Win + V show the correct copied text?

Have you enabled Tools -> Options -> Advanced -> set_clipboard_data_text ? enabling will set Unicode and ANSI text which might be causing trouble.
("use default" == "disabled" on Windows XP and later)
therube
Posts: 5711
Joined: Thu Sep 03, 2009 6:48 pm

Re: report a bug regarding Unicode character preservation when copying filenames/paths to the clipboard in Everything 1.

Post by therube »

Not seeing that either.
If I copy from your post, & paste it into Notepad3, it pastes correct.
If I create a file on disk with that name, & I find & copy the name, from Everything, & paste it into Notepad3, it pastes correctly.

Now clipboard, I don't really understand, particularly, but (as mentioned above),
OEM TEXT does show x28, which is an actual '(' (paren).

Oh, & the encoding used by your editor will make a difference.
Below, I'm using UTF-8. If I change that to ANSI (CP-1252), then it will show as x0028.
.
Everything - xFF08.png
Everything - xFF08.png (9.96 KiB) Viewed 71 times
.
Everything - xFF08 - ansi.png
Everything - xFF08 - ansi.png (10.67 KiB) Viewed 70 times
Post Reply