Hi, my name is Matt and I'm Chinese.
I find this software very useful. But when I type a random word into the search bar, the sorting method is quite messed up.
The files and folders with Chinese names were not sorted alphabetically(Pinyin), instead, they use an outdated method called Stroke order.
No one uses this kind of ancient technique, except someone with zero knowledge of Pinyin.
Some might say you can change the sorting method in Control Panal on Windows. I did, and nothing happens.
So I was wondering if you guys fix it, please!!! Thank you!!!
https://en.wikipedia.org/wiki/Stroke_order
https://en.wikipedia.org/wiki/Pinyin
[Chinese Sorting] Sort by name is messed up
-
775405984
- Posts: 41
- Joined: Tue May 30, 2017 5:32 pm
-
775405984
- Posts: 41
- Joined: Tue May 30, 2017 5:32 pm
Re: [Chinese Sorting] Sort by name is messed up
were you ever gonna fix this ever? Come on, please.
-
NotNull
- Posts: 5961
- Joined: Wed May 24, 2017 9:22 pm
Re: [Chinese Sorting] Sort by name is messed up
It is number 227 on the to do list775405984 wrote:were you ever gonna fix this ever? Come on, please.
Cut @void some slack ....
-
Debugger
- Posts: 719
- Joined: Thu Jan 26, 2017 11:56 am
Re: [Chinese Sorting] Sort by name is messed up
NotNull - such a long list will be realized but in the next century 
-
Debugger
- Posts: 719
- Joined: Thu Jan 26, 2017 11:56 am
Re: [Chinese Sorting] Sort by name is messed up
I am most annoyed with the need to introduce the preceding expressions, it is a waste of time for me and not everyone remembers all these.
-
void
- Developer
- Posts: 19899
- Joined: Fri Oct 16, 2009 11:31 pm
Re: [Chinese Sorting] Sort by name is messed up
Currently, Everything sorts filenames by unicode code points, which is completely wrong, but fast!
I have added to my TODO list to support Unicode Collation Algorithm (UCA). Hopefully this will be available in the next release of Everything.
While this is not pinyin, it might be 'good enough'.
Implementing Pinyin at this stage will be unfeasible. The sorting rules are too complex, I can't use third party sorting or the Windows API to sort as these could change at any time and it is critical the Everything database is sorted in a specific way.
I also have concerns about UCA as storing these collation lookup tables will require quite a bit of data.. (128K+)
There is also a small performance hit with adding collation lookup tables.
https://unicode.org/faq/collation.html
I have added to my TODO list to support Unicode Collation Algorithm (UCA). Hopefully this will be available in the next release of Everything.
While this is not pinyin, it might be 'good enough'.
Implementing Pinyin at this stage will be unfeasible. The sorting rules are too complex, I can't use third party sorting or the Windows API to sort as these could change at any time and it is critical the Everything database is sorted in a specific way.
I also have concerns about UCA as storing these collation lookup tables will require quite a bit of data.. (128K+)
There is also a small performance hit with adding collation lookup tables.
https://unicode.org/faq/collation.html
-
void
- Developer
- Posts: 19899
- Joined: Fri Oct 16, 2009 11:31 pm
Re: [Chinese Sorting] Sort by name is messed up
The Everything 1.5 alpha adds support for sorting by Unicode weights.
-
775405984
- Posts: 41
- Joined: Tue May 30, 2017 5:32 pm
Re: [Chinese Sorting] Sort by name is messed up
I updated to 1269a, and it changes nothing.void wrote: Sun Jul 11, 2021 11:49 pm The Everything 1.5 alpha adds support for sorting by Unicode weights.
Thanks for trying, but I'm not seeing the improvements.
-
therube
- Posts: 5727
- Joined: Thu Sep 03, 2009 6:48 pm
Re: [Chinese Sorting] Sort by name is messed up
Maybe some hints as to what it is not doing correctly?
And is it (at least) "correct" per the Unicode Collation Algorithm?
And is it (at least) "correct" per the Unicode Collation Algorithm?
-
775405984
- Posts: 41
- Joined: Tue May 30, 2017 5:32 pm
Re: [Chinese Sorting] Sort by name is messed up
This is an example of how messed up sorting by name is.therube wrote: Mon Jul 26, 2021 5:21 pm Maybe some hints as to what it is not doing correctly?
And is it (at least) "correct" per the Unicode Collation Algorithm?
I didn't find anything wrong with UCA though. What I did find is that Unicode doesn't support pinyin.
You can link UCA character to GBK,GBK support pinyin.
I'm not a developer, so I don't know how to do it. I'll post this to a Chinese website to see if anyone have any ideas.
Thank you!
You do not have the required permissions to view the files attached to this post.