Search found 62 matches

by pandunia-guru
2024-11-07 17:50
Forum: Panlexia project
Topic: Generating bilingual dictionaries
Replies: 5
Views: 5670

Re: Generating bilingual dictionaries

Today I added a shell script for creating all bilingual dictionaries for a language at once. Execute it in Linux (native or WSL) by typing for example:
sh generate_bilingual_dictionaries.sh pandunia
And it will populate the generated/ folder with lots of simple but nice dictionaries. :)
by pandunia-guru
2024-11-05 20:05
Forum: Panlexia project
Topic: Generating bilingual dictionaries
Replies: 5
Views: 5670

Generating bilingual dictionaries

I wrote a Python program for writing bilingual dictionaries called write_bilingual_dict.py . It is run from the command line like this:
python3 src/write_bilingual_dict.py <source language code> <target language code>

A practical example:
python3 src/write_bilingual_dict.py pandunia eng

With this ...
by pandunia-guru
2024-11-04 06:14
Forum: Panlexia project
Topic: Soya plant and soya bean... and both
Replies: 1
Views: 2462

Re: Soya plant and soya bean... and both

This is my solution.

Plant:Glycine max.N An annual plant native to East Asia that is widely cultivated for its fruit. 2027 1095
Ingestion:Glycine max.N The edible seed of the soybean plant which contains a lot of protein. 2028

It's because definitions in ULD are less precise than in Concepticon ...
by pandunia-guru
2024-11-01 09:52
Forum: Panlexia project
Topic: Everyone should use Git and GitHub?
Replies: 9
Views: 9792

Re: Everyone should use Git and GitHub?

You can see list of the files you have modified by typing "git status".
by pandunia-guru
2024-11-01 09:50
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 3603

Re: WordNet - A Lexical Database for English

In my opinion it is the best order to when the word-class marker is last. I like to group, for example, to clean (v.), clean (adj.) and (the act of) cleaning (n.) in the same semantic category. Apparently it wouldn't be even possible in WordNet!

You are right about consumption. What about ingestion ...
by pandunia-guru
2024-11-01 08:42
Forum: Panlexia project
Topic: Open Multilingual Wordnet (OMW)
Replies: 1
Views: 2108

Open Multilingual Wordnet (OMW)

I found another goldmine! Open Multilingual Wordnet is a resource that bundles together wordnets in different languages. It is essentially a collection of files with tab-separated values. Here's a sample from the Indonesian file.


00019613-n ind:def 0 masalah fisik yang nyata
00019613-n ind:lemma ...
by pandunia-guru
2024-11-01 05:24
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 3603

Re: WordNet - A Lexical Database for English

I downloaded WordNet files. The file wn3.1.dict.tar.gz includes the newest database. I looked inside it and found out that the file names reveal the concept categories.

Adjective and adverb categories don't have any useful information.


adj.all
adj.pert
adj.ppl
adv.all


WordNet categorizes ...
by pandunia-guru
2024-10-30 06:37
Forum: english tok
Topic: To speak about the application phpBB forum
Replies: 5
Views: 5914

Re: To speak about the application phpBB forum

It's true that PhpBB doesn't store pictures. I think it's because the size of the forum database would get multiplied if pictures or even videos were allowed. (Reddit is similar. It doesn't allow posting images except as links to external sites.) I host this forum myself and my server doesn't have ...
by pandunia-guru
2024-10-30 06:18
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 3603

Re: WordNet - A Lexical Database for English

WordNet is a great finding but it's hard to say how useful it is for Panlexia. WordNet is a monolingual English database and it doesn't link to any translations, as you said. However, I found a word list in Concepticon, Borin 2015 1532 , which "contains an additional mapping to the Princeton Wordnet ...
by pandunia-guru
2024-10-29 16:04
Forum: Panlexia project
Topic: Word lists in many languages
Replies: 1
Views: 2010

Word lists in many languages

We have finalized over 1500 concept ids now. On the side I have written programs to extract word lists from NELex, ULD and WOLD. Today I tried them out and they are working. I uploaded a handful of dictionaries to the Panlexia repository in GitHub.

I also made a word list for Pandunia by myself. It ...