Advanced option or tricks to find dupe with partial names

Discussion related to "Everything" 1.5.
Post Reply
luke32
Posts: 6
Joined: Fri Mar 28, 2025 3:59 am

Advanced option or tricks to find dupe with partial names

Post by luke32 »

Hi, Now I have too many folders most of it are duplicates but it have different name sometimes bad names but All have some partial names

example

original folder = (666)-alarma_99_vinyl_perplexer-1999

duplicate 1 = 666 alarma 99 perplexer 1999

duplicate 2 = 666 alarma

duplicate 3 = alarma perplexer 1999

duplicate 4 = 666 perplexer

anyway to list all these as duplicate on roll under each other when I search for dupe: like this ? size could be slightly different also and even files inside could have different names or extra file

any help please
ChrisGreaves
Posts: 821
Joined: Wed Jan 05, 2022 9:29 pm

Re: Advanced option or tricks to find dupe with partial names

Post by ChrisGreaves »

luke32 wrote: Mon Apr 14, 2025 1:59 amHi, Now I have too many folders most of it are duplicates but it have different name sometimes bad names but All have some partial names
Hi Luke. I'd suggest:-
(1) Read this article about Soundex just once to get the feel of it then
(2) Consider running your own tests on a bit of program code that demonstrates Soundex (for example, try all variants of your surname, your full name etc.) then
(3) Consider implementing Soundex in a search with Everything.

Yes, I have an implementation of Soundex in MSWord/VBA. Please PM me if you'd like to download the VBA code.
Cheers, Chris
P.S. Here are the initial tests I ran on your supplied examples of folder names:-

Code: Select all

Debug.Print strSoundex("666 alarma 99 perplexer 1999")
Debug.Print strSoundex("666 alarma")
Debug.Print strSoundex("alarma perplexer 1999")
Debug.Print strSoundex("666 perplexer")

Debug.Print ""
Debug.Print strSoundex("alarma 99 perplexer 1999")
Debug.Print strSoundex("alarma")
Debug.Print strSoundex("alarma perplexer 1999")
Debug.Print strSoundex("perplexer")

6465
6465
a465
6161

a465
a465
a465
p614
ChrisGreaves
Posts: 821
Joined: Wed Jan 05, 2022 9:29 pm

Re: Advanced option or tricks to find dupe with partial names

Post by ChrisGreaves »

ChrisGreaves wrote: Mon Apr 14, 2025 1:53 pm (2) Consider running your own tests on a bit of program code that demonstrates Soundex (for example, try all variants of your surname, your full name etc.)
At this point I should point out that you are not committed to Soundex or to any other well-defined algorithm.
In particular: If you have a good knowledge of your target file-names (suppose that they are all Classical Music symphonies in a variety of keys, optionally with a composer's name; or suppose that they are all movies; or ...) then you might want to devise your own algorithm with a view to eliminating the "noise" type characters.

For example, with the data that you supplied, what happens if we squeeze out decimal digits before Soundex analysis?
For example, with the data that you supplied, what happens if we squeeze out decimal digits before YourAlgorithm analysis?
What results (of matching) do you get if you eliminate vowels?
What results (of matching) do you get if you eliminate duplicate characters in any pair of adjacent characters
What results (of matching) do you get if you eliminate any duplicate characters, that is, if you consider only the unique characters?

There is always the GOAL of your matching: Will this be a one-off solution for your files, only or are their plans to release this for general use by the global population?
Cheers, Chris
luke32
Posts: 6
Joined: Fri Mar 28, 2025 3:59 am

Re: Advanced option or tricks to find dupe with partial names

Post by luke32 »

Hello , I have zero experience with coding , sorry.

The example was just one original folder with about 10 or less duplicates with other names , I have so much folders like this example but with different names for sure

I just want to know if there is anything I can do to show possible duplicate folders ?

Sorry if my request very complicated :?
ChrisGreaves
Posts: 821
Joined: Wed Jan 05, 2022 9:29 pm

Re: Advanced option or tricks to find dupe with partial names

Post by ChrisGreaves »

luke32 wrote: Tue Apr 15, 2025 1:58 amHello , I have zero experience with coding , sorry.
Why feel sorry?
I just want to know if there is anything I can do to show possible duplicate folders ?
Yes. There is something that you can do,
Did you read (and think) about existing methods to extract the kernel of data from mixed names?
Sorry if my request very complicated :?
Why sorry? Your request seems quite simple to me (and perhaps to other members of this forum).

After you had read that Wikipedia page and other material, what did you think?
If your reaction is "Oh! That's too complicated for me" then the answer will be "No!".
But if you think along the lines of "There are already several possibilities at hand, even prepared coding!", then there's a chance you can have a solution, yes?
Cheers, Chris
therube
Posts: 5723
Joined: Thu Sep 03, 2009 6:48 pm

Re: Advanced option or tricks to find dupe with partial names

Post by therube »

666 is common to all.
alarma or perplexer are common to all 666's.

So,
666 <alarma | perplexer>


So, 666 AND alarma, or, 666 AND perplexer.
luke32
Posts: 6
Joined: Fri Mar 28, 2025 3:59 am

Re: Advanced option or tricks to find dupe with partial names

Post by luke32 »

Guess I don't get it , you guys are have more advanced info more than what I can do , I like simple options , so when I reinstall or use different computer I can do it anywhere without moving and copy files , just ticking some boxes would make life much easy I still believe this tool is the best . maybe someday the Dev will add more option for partial search., I hope.
therube
Posts: 5723
Joined: Thu Sep 03, 2009 6:48 pm

Re: Advanced option or tricks to find dupe with partial names

Post by therube »

Everything 1.5 alpha has an "Advanced Search" (form) [that I'm not really familiar with],
but I'm thinking if you used that & set:

Filenames containing...
all these words:
666

any of these words:
alarma perplexer


should do it?
Post Reply