Welcome to the Invelos forums. Please read the forum rules before posting.

Read access to our public forums is open to everyone. To post messages, a free registration is required.

If you have an Invelos account, sign in to post.

    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 4  Previous   Next
What about a filter on accented letters...?
Author Message
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
...in Common Name contributions?
-- Enry
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Not a good idea, Enry. Francois is not automatically François, Jose is not automatically José. Toi pretend that it is via filter would be to inroduce an element of error into the database.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited RegistrantStar ContributorDarklyNoon
No Godz, No Masterz
Registered: May 8, 2007
Reputation: Highest Rating
Germany Posts: 1,945
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
I agree with Skip, this can cause  false entries.

cheers
Donnie
www.tvmaze.com
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Have a look at all the non-linking variants of Miguel Núñez in the CLT, for instance.

A filter would mean that
Miguel Nunez
Miguel Núñez
and
Miguel Nuñez
automatically link,
while the credits would stay exactly as they are now.

:false entries
IF there were a Nunez and a Núñez (different people), we could distinguish using a BY so that they do not link, but I don't think that would happen very often.
-- Enry
 Last edited: by White Pongo, Jr.
DVD Profiler Unlimited Registrantantolod
Since Dec 02, 2003
Registered: March 13, 2007
Reputation: High Rating
United States Posts: 940
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
If your suggestion is that all common names would be accent-less (is that a word?) then I would be for that. Makes finding the person easier than having to type special characters in the search box.

How a filter could fix Nunez, however is a problem. As near as I can tell, he has never been credited with the variant that the CLT results of January 2 shows to be his most credited form which is the chosen IMDB name. The only way these credits will ever be fixed is for users to correct their profiles. Having an automatic filter to create entries contributing Miguel A. Núñez Jr. to: Miguel A. Nunez, Jr. [Miguel A. Núñez Jr. ] would be wrong, since he really is NOT credited as Miguel A. Núñez Jr. in most if not all of the films currently profiled that way.
Kevin
DVD Profiler Unlimited RegistrantStar ContributorGSyren
Profiling since 2001
Registered: March 14, 2007
Reputation: Highest Rating
Sweden Posts: 4,678
Posted:
PM this userVisit this user's homepageView this user's DVD collectionDirect link to this postReply with quote
Quoting antolod:
Quote:
If your suggestion is that all common names would be accent-less (is that a word?) then I would be for that. Makes finding the person easier than having to type special characters in the search box.

The problem with that is that what's an accent in one language may not be an accent in another language.

Example: The letters å, ä and ö in Swedish are not accented, but separate letters (they come after z). Changing "Björn" into "Bjorn", for example, would be absolutely wrong.
My freeware tools for DVD Profiler users.
Gunnar
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Quoting antolod:
Quote:
If your suggestion is that all common names would be accent-less


Yep!

Quote:

(is that a word?) then I would be for that. Makes finding the person easier than having to type special characters in the search box.


One more reason. 

Quote:


How a filter could fix Nunez, however is a problem. As near as I can tell, he has never been credited with the variant that the CLT results of January 2 shows to be his most credited form which is the chosen IMDB name. The only way these credits will ever be fixed is for users to correct their profiles. Having an automatic filter to create entries contributing Miguel A. Núñez Jr. to: Miguel A. Nunez, Jr. [Miguel A. Núñez Jr. ] would be wrong, since he really is NOT credited as Miguel A. Núñez Jr. in most if not all of the films currently profiled that way.


If the credited name is wrong, let's correct the credits. But the main database is choc full with such incorrect IMDb entries. Using a filter for the Common Name would at least link all them and make them more readily searchable, like you said.
-- Enry
DVD Profiler Unlimited RegistrantStar ContributorRHo
Registered: March 13, 2007
Posts: 2,759
Posted:
PM this userDirect link to this postReply with quote
Quoting GSyren:
Quote:
The problem with that is that what's an accent in one language may not be an accent in another language.

Example: The letters å, ä and ö in Swedish are not accented, but separate letters (they come after z). Changing "Björn" into "Bjorn", for example, would be absolutely wrong.

Another example: The umlauts ä, ö, ü in German are not accents as well. If they have to be converted to ASCII, then the correct conversion would be ae, oe, ue.
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Quoting GSyren:
Quote:
Quoting antolod:
Quote:
If your suggestion is that all common names would be accent-less (is that a word?) then I would be for that. Makes finding the person easier than having to type special characters in the search box.

The problem with that is that what's an accent in one language may not be an accent in another language.

Example: The letters å, ä and ö in Swedish are not accented, but separate letters (they come after z). Changing "Björn" into "Bjorn", for example, would be absolutely wrong.


I think the main purpose of the Common Name is not to be absolutely "correct" (the Credited As can be used to store the correct credit exactly as it is) but more importantly to link and make readily searchable all the different variants in the database.

For instance, the  actor who played "Olaf" in Titanic, can be found in DVDP profiles for "Titanic" as
"Bjorn Olsen" 
"Björn Olsen"
and
"Bjørn Olsen".

The three variants don't even link, unless the Common Name is found and entered every time a contribution is made.
-- Enry
DVD Profiler Unlimited RegistrantWhite Pongo, Jr.
No, I iz no Cheshire Cat!
Registered: August 22, 2007
Reputation: High Rating
Posts: 1,807
Posted:
PM this userDirect link to this postReply with quote
Quoting RHo:
Quote:
Another example: The umlauts ä, ö, ü in German are not accents as well. If they have to be converted to ASCII, then the correct conversion would be ae, oe, ue.


Yep, but I wouldn't expect the filter to convert "correctly" all the special letters in all languages.
I would just expect the  filter to link and make more readily searchable a number of variants in the database.

However, maybe there are some letters that are easier to deal with, because they never translate to two letters. For instance,  accented vowels like "è", "é" and so on.
-- Enry
DVD Profiler Unlimited RegistrantStar ContributorRHo
Registered: March 13, 2007
Posts: 2,759
Posted:
PM this userDirect link to this postReply with quote
Quoting EnryWiki:
Quote:
Yep, but I wouldn't expect the filter to convert "correctly" all the special letters in all languages.
I would just expect the  filter to link and make more readily searchable a number of variants in the database.

The name fields are not just linking labels, they have a meaning and should contain correct names.
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
Wrong again, Rho. But just for yucks, what is your definition of "correct". I just love it when you use these terms but don't bother to explain them, like your reference to equivalents. And BTW, we do use CORRECT names, those would be the names that are CREDITED for each individual film....that is REAL data, not something which likely exists in your imagination.

Ken has stated numerous times that "correct" name is not what we are after, we are after the MOST COMMONLY CREDITED name. Do you want us to list John Wayne's given name or Judy Garland's.<shrugs>

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
 Last edited: by Winston Smith
DVD Profiler Unlimited RegistrantStar Contributorsurfeur51
Since July 3, 2003
Registered: March 29, 2007
Reputation: Great Rating
France Posts: 4,479
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Dr Pavlov:
Quote:
... we do use CORRECT names, those would be the names that are CREDITED for each individual film....that is REAL data, not something which likely exists in your imagination.


1+1=3

That is REAL data. It exists now in Invelos forum's computers, everybody can read it...

Is it correct ?
Images from movies
DVD Profiler Unlimited RegistrantStar ContributorWinston Smith
Don't be discommodious
Registered: March 13, 2007
United States Posts: 21,610
Posted:
PM this userEmail this userView this user's DVD collectionDirect link to this postReply with quote
It is correct in that it is real data that actually appears in relation to any given film, surfeur. I don't care for anyone's imagined or hallucinated data. The Credits are the only "real" data. I know that you believe that you know more than the people who make the films, even though you have no involvement in any of them. You deal only with your local data, so for you, you are free to follow whatever you wish. We gad Guidelines, once upon a time, those Guidelines turnedour database into a mess because it was pure anarchy, every user was fighting to manipulate the Online database to his particular wishes to hell with anyone else, including how the titles were sorted. We had titles which literally ping-ponged back and forth on a weekly basis. You may think such a thing is wonderful, Yves, but its not, so we developed a system for the Online that merely attempts to get everyone on the SAME page, instead of their own. I don't care what you do locally and what you think is correct THERE, the Online is another story.

And as with Rho your endless crusade became tiresome long ago, please drop it, Wre have system, it has worked for years now and will continue to work into the future, but I do not believe that Ken will return us to system of anarchy which you prefer.

Skip
ASSUME NOTHING!!!!!!
CBE, MBE, MoA and proud of it.
Outta here

Billy Video
DVD Profiler Unlimited RegistrantStar ContributorRaymondG
Registered: July 7, 2007
Netherlands Posts: 284
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Skip is right about the Credits: That's what Invelos prescibes in the rules.
Problem is that the REAL people whom those credtis apply to, often have more then one variant in those credits.
One of the major advantages of a compehensive database for all your films is that you can see for a certain cast or crew member in which titles he was credited (linking). That is something to work for. It may however never compromise the Credited As information.
In any case, you cannot leave out accents from names for Credited As information. The reference to the real person could be subject to such a method but I myself am against that although I certainly see the benefits that it brings (easier searching etc.). The problem is that we cannot cancel out the meaning of a certain accent just because it is easier to deal with.
As Skip said about John Wayne; some people use artistnames but that's an entirely different matter. I assume we use the artist name always in those cases.
My DVD's

Who is General Failure and why is he reading my hard drive?
DVD Profiler Unlimited RegistrantStar Contributorsurfeur51
Since July 3, 2003
Registered: March 29, 2007
Reputation: Great Rating
France Posts: 4,479
Posted:
PM this userView this user's DVD collectionDirect link to this postReply with quote
Quoting Dr Pavlov:
Quote:
It is correct in that it is real data that actually appears in relation to any given film, surfeur. I don't care for anyone's imagined or hallucinated data...


Credits are correct as long as somebody doesn't make a typo. Typos exist (I'have even seen some in those forums...), and if you find Jean Paul Belmondo in end credits, it's just a typo. I'm not hallucinated when I affirm that the correct name is Jean-Paul Belmondo, and can cite thousands pages that confirm that.
Images from movies
    Invelos Forums->DVD Profiler: Contribution Discussion Page: 1 2 3 4  Previous   Next