-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fake names noise function Implementation of fake names noise function - *Category*: Feature - *JIRA issue*: [MIC-3878](https://jira.ihme.washington.edu/browse/MIC-3878) -Adds implementation of fake names noise functions and unit tests -Adds fake names module to data directory for raw data Testing All tests pass.
- Loading branch information
Showing
4 changed files
with
311 additions
and
11 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,188 @@ | ||
""" | ||
This module includes lists of fake first and last names copied from a .pdf | ||
version of a NORC report on PVS. | ||
TODO: Find a link to the NORC report Abie used. | ||
""" | ||
|
||
fake_first_names = [ | ||
"GIRL", | ||
"MOM", | ||
"A", | ||
"GOH", | ||
"MOTHER", | ||
"ADULT", | ||
"GRANDCHILD", | ||
"MR", | ||
"ADULT MALE", | ||
"GRANDDAUGHTER", | ||
"MRS", | ||
"B", | ||
"GRANDSON", | ||
"MS", | ||
"BABY", | ||
"H", | ||
"N", | ||
"BOY", | ||
"HIJA", | ||
"NEPHEW", | ||
"BROTHER", | ||
"HIJO", | ||
"NINO", | ||
"C", | ||
"HOUSE", | ||
"O", | ||
"CHILD", | ||
"HUSBAND", | ||
"OLDEST", | ||
"CHILD F", | ||
"INMATE", | ||
"ONE", | ||
"COH", | ||
"J", | ||
"P", | ||
"D", | ||
"K", | ||
"PERSON", | ||
"DAD", | ||
"KID", | ||
"R", | ||
"DAU", | ||
"L", | ||
"RESIDENT", | ||
"DAUGHTER", | ||
"LADY", | ||
"RESPONDENT", | ||
"DAUGHTER OF", | ||
"LADY IN THE", | ||
"S", | ||
"DOH", | ||
"LADY OF", | ||
"SENOR", | ||
"E", | ||
"LADY OF HOUSE", | ||
"SENORA", | ||
"F", | ||
"LADY OF THE", | ||
"SISTER", | ||
"FATHER", | ||
"LOH", | ||
"SOH", | ||
"FEMALE", | ||
"M", | ||
"SON", | ||
"FEMALE CHILD", | ||
"MALE", | ||
"SON OF", | ||
"FRIEND", | ||
"MALE CHILD", | ||
"T", | ||
"G", | ||
"MAN", | ||
"V", | ||
"GENT", | ||
"MAN IN THE", | ||
"W", | ||
"GENTELMAN", | ||
"MAN OF", | ||
"WIFE", | ||
"GENTLE", | ||
"MAN OF THE", | ||
"WOMAN", | ||
"GENTLEMAN", | ||
"MINOR", | ||
"YOUNGEST", | ||
"GENTLEMAN OF", | ||
"MISS", | ||
"GENTLEMEN", | ||
"MOH", | ||
] | ||
|
||
fake_last_names = [ | ||
"HH", | ||
"OF THE HOUSE", | ||
"A", | ||
"HHM", | ||
"ONE", | ||
"ADULT", | ||
"HOME", | ||
"OWNER", | ||
"ANON", | ||
"HOUSE", | ||
"P", | ||
"ANONYMOUS", | ||
"HOUSEHOLD", | ||
"PARENT", | ||
"APELLIDO", | ||
"HOUSEHOLDER", | ||
"PERSON", | ||
"B", | ||
"HUSBAND", | ||
"R", | ||
"BOY", | ||
"J", | ||
"REF", | ||
"C", | ||
"K", | ||
"REFUSE", | ||
"CASA", | ||
"L", | ||
"RESIDENT", | ||
"CHILD", | ||
"LADY", | ||
"RESP", | ||
"COH", | ||
"LADY OF HOUSE", | ||
"RESPONDANT", | ||
"D", | ||
"LADY OF THE HOUSE", | ||
"RESPONDENT", | ||
"DAUGHTER", | ||
"LAST NAME", | ||
"S", | ||
"DE CASA", | ||
"LOH", | ||
"SOH", | ||
"DE LA CASA", | ||
"M", | ||
"SON", | ||
"DECLINED", | ||
"MALE", | ||
"T", | ||
"DOE", | ||
"MAN", | ||
"THE HOUSE", | ||
"DOH", | ||
"MAN OF THE HOUSE", | ||
"THREE", | ||
"DONT KNOW", | ||
"MOH", | ||
"TWO", | ||
"E", | ||
"N", | ||
"UNK", | ||
"F", | ||
"NA", | ||
"UNKNOWN", | ||
"FEMALE", | ||
"NO", | ||
"W", | ||
"FOUR", | ||
"NO LAST NAME", | ||
"WIFE", | ||
"FRIEND", | ||
"NO NAME", | ||
"X", | ||
"G", | ||
"NONE", | ||
"XXX", | ||
"GIRL", | ||
"O", | ||
"Y", | ||
"GOH", | ||
"OCCUPANT", | ||
"YOUNGER", | ||
"H", | ||
"OF HOUSE", | ||
"H AGE", | ||
"OF THE HOME", | ||
] |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.