I _Really_ Don't Know

A low-frequency blog by Rob Styles

What's in a name?

RePEc, Research Papers in Economics, have been doing some interesting work on allowing the author community to self-specify variant forms of their name.

This is interesting, the example they give is of John Maynard Keynes:

For John Maynard Keynes (who is not registered), such name variations could be:

John Maynard Keynes
John M. Keynes
John Keynes
J. M. Keynes
J. Keynes
Keynes, John Maynard
Keynes, John M.
Keynes, John
Keynes, J. M.
Keynes, J.

Unfortunately Keynes has not registered these himself, I'd love to see some examples from the 16,000 authors who have registered.

This data comes from published papers as well as monograph data, so the variance is a lot higher, I suspect, than occurs solely within libraries and the absence of years of birth (and death) makes it highly ambiguous. RePEc have published a list of author name homonyms. Even within published papers on economics this is a reasonably problematic list.

I've been working on very similar issues with library data, trying to analyze the meaning in marc data.

One of the key things for me was developing techniques for dealing with ambiguity, allowing outside systems with limited data to receive the most specific answer possible, while giving less specific answers for less specific matches.

Comments

Rhys

I have more interest than most in the topic of names, being as I usually use my middle name as you know. I'd be quite intrigued to see how this works, especially if it would rein in some of the confusion that variant forms of my name cause!