Topic: Some useful dictionaries

Go back to forum.

Topic by
beno

2008-09-23
07:40

Some useful dictionaries

Hi everybody,

I've been trying out your product and lurking on these forums for a couple of weeks and I really like what I'm seeing. I've been using DataCleaner to profile a dataset consisting mainly of people-data, such as names, nationality etc.

For this I started using the provided name dictionaries but quickly I understood that these are only for Danish people?! Anyways, I've collected four new dictionaries that I think could perhaps be of some use by everyone, so I want to know if you want them for your application?

The dictionaries are: List of nationalities, List of countries, US top 1000 male names, US top 1000 girl names
Reply by
kasper

2008-09-23
08:37
Hello beno,

Yes this is really something we could use. I agree we should have more dictionaries included in DataCleaner and this is also something I think we will intensify a lot the next couple of months. If I could get you to attach them as text files to ticket #25 then it would be great!
Reply by
beno

2008-09-23
08:52
I've uploaded the dictionaries that I had. I also included a US states list and took a little time to find the last names for US also. Now I'm off to bed, nightie :)
Reply by
beno

2008-09-23
08:57
For reference, you can find US names here:

http://www.ssa.gov/OACT/babynames/

I just used the most popular names of 2007 but actually I'm thinking if that may have been a bad decision - I mean, it would be best with a population-based number instead of the newborn one.
Reply by
kasper

2008-09-23
10:30
Thank you very much beno!

Regarding the baby names I will try and look for other sources for this information and make a merged dictionary out of them then. Thanks for the contributions.
Reply by
henry.coleman

2008-12-24
12:40
Thank you beno and kasper,

I applied the US naming dictionaries yesterday and it got my job done before Christmas break! Cheers and have a merry Christmas.

You need to be logged in to participate

In order to post your own comments on this topic, you need to be logged in.

Username:

Log in by clicking the login link at the top of the screen

 

Go back to forum.

Username:

Password:

Requested username:

Password:

Real name:

Email address:

Title:

Company:

Country: