Skip to content

SURFLOU/ChooseLanguage

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ChooseLanguage

It's app which calculates given your preferences which foreign language should you learn next.

How to initialize an app:

connection = connect_to_db("localhost", "root", "", "language")
create_tables(connection)
df = normalize_df(language_list, connection)
df = set_wages(df, wages_list)

Output of dataframe given language_list for instance as Polish and wages_list as [1,2,3,4,5]:

      Language  Lexical_similarity  Phoneme_similarity  Family_of_language  \
0   Portuguese                 0.0            1.625000                 0.0   
1      English                 0.0            1.437500                 0.0   
2      Russian                 1.0            0.000000                 2.0   
3      Spanish                 0.0            1.393939                 0.0   
4    Norwegian                 0.0            1.437500                 0.0   
5       German                 0.0            1.437500                 0.0   
6        Dutch                 0.0            0.821429                 0.0   
7      Italian                 0.0            1.121951                 0.0   
8       French                 0.0            1.150000                 0.0   
9      Swedish                 0.0            1.437500                 0.0   
10      Danish                 0.0            1.437500                 0.0   

    Number_of_hours  Total_speakers  Amount_of_job_offers  Total_sum  
0               3.0        0.812263              0.000000   5.437263  
1               3.0        4.000000              5.000000  13.437500  
2               0.0        0.897650              0.115596   4.013246  
3               3.0        1.877567              0.255561   6.527067  
4               3.0        0.000000              0.241190   4.678690  
5               2.1        0.450289              2.418145   6.405935  
6               3.0        0.063065              0.269308   4.153801  
7               3.0        0.222152              0.270557   4.614660  
8               3.0        0.974288              0.521120   5.645408  
9               3.0        0.026635              0.083729   4.547864  
10              3.0        0.002484              0.254936   4.694921  

It calculates such things as:

  • Number of job offers: (based on website pracuj.pl) Number of job offers
  • Total number of native speakers: Number of native speakers
  • Rate of phonemes that are the same in every language:

  • Each language has it own phonemes in International Phonetical Alphabet. Each of this phonemes can be written as hexadecimal code, for instance:
    german_phonemes = [
        "\u0061", "\u0062", "\u0063", "\u0064", "\u0065", "\u0066", "\u0067", "\u0068", "\u0069", "\u006a", "\u006b", "\u006c",
        "\u006d", "\u006e", "\u006f", "\u0070", "\u0071", "\u0072", "\u0073", "\u0074", "\u0075", "\u0076", "\u0077", "\u0078",
        "\u0079", "\u007a", "\u00e4", "\u00c4", "\u00f6", "\u00d6", "\u00fc", "\u00dc"
    ]

    I calculate the ratio how many phonemes differ from one language with another, the less the amount is the higher score is.

  • Lexical similarity (based on this study: Study
  • Lexical similarity is a measure of the degree to which the word sets of two given languages are similar.

    Lexical similarity
  • Number of hours needed to learn a language (based on FSI) Number of hours needed
  • Type of family

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages