Search found 69 matches

by pandunia-guru
2024-11-01 09:52
Forum: Panlexia project
Topic: Everyone should use Git and GitHub?
Replies: 9
Views: 306072

Re: Everyone should use Git and GitHub?

You can see list of the files you have modified by typing "git status".
by pandunia-guru
2024-11-01 09:50
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 52645

Re: WordNet - A Lexical Database for English

In my opinion it is the best order to when the word-class marker is last. I like to group, for example, to clean (v.), clean (adj.) and (the act of) cleaning (n.) in the same semantic category. Apparently it wouldn't be even possible in WordNet!

You are right about consumption. What about ingestion ...
by pandunia-guru
2024-11-01 08:42
Forum: Panlexia project
Topic: Open Multilingual Wordnet (OMW)
Replies: 1
Views: 28930

Open Multilingual Wordnet (OMW)

I found another goldmine! Open Multilingual Wordnet is a resource that bundles together wordnets in different languages. It is essentially a collection of files with tab-separated values. Here's a sample from the Indonesian file.


00019613-n ind:def 0 masalah fisik yang nyata
00019613-n ind:lemma ...
by pandunia-guru
2024-11-01 05:24
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 52645

Re: WordNet - A Lexical Database for English

I downloaded WordNet files. The file wn3.1.dict.tar.gz includes the newest database. I looked inside it and found out that the file names reveal the concept categories.

Adjective and adverb categories don't have any useful information.


adj.all
adj.pert
adj.ppl
adv.all


WordNet categorizes ...
by pandunia-guru
2024-10-30 06:37
Forum: english tok
Topic: To speak about the application phpBB forum
Replies: 5
Views: 37000

Re: To speak about the application phpBB forum

It's true that PhpBB doesn't store pictures. I think it's because the size of the forum database would get multiplied if pictures or even videos were allowed. (Reddit is similar. It doesn't allow posting images except as links to external sites.) I host this forum myself and my server doesn't have ...
by pandunia-guru
2024-10-30 06:18
Forum: Panlexia project
Topic: WordNet - A Lexical Database for English
Replies: 5
Views: 52645

Re: WordNet - A Lexical Database for English

WordNet is a great finding but it's hard to say how useful it is for Panlexia. WordNet is a monolingual English database and it doesn't link to any translations, as you said. However, I found a word list in Concepticon, Borin 2015 1532 , which "contains an additional mapping to the Princeton Wordnet ...
by pandunia-guru
2024-10-29 16:04
Forum: Panlexia project
Topic: Word lists in many languages
Replies: 1
Views: 31530

Word lists in many languages

We have finalized over 1500 concept ids now. On the side I have written programs to extract word lists from NELex, ULD and WOLD. Today I tried them out and they are working. I uploaded a handful of dictionaries to the Panlexia repository in GitHub.

I also made a word list for Pandunia by myself. It ...
by pandunia-guru
2024-10-23 21:07
Forum: esperanto tok
Topic: Artikolo pri Panglo en Esperanto
Replies: 1
Views: 29239

Artikolo pri Panglo en Esperanto

La gazeto Esperanta Finnlando publikigis mian artikolon Internaciaj lingvoj en komparo en la plej nova numero, 4b/2024. Ĝi temas pri internaciaj lingvoj kaj komparas trajtojn de Pandunia 3 (t.e. Panglo!), Esperanto kaj la angla.
by pandunia-guru
2024-10-17 17:46
Forum: Panlexia project
Topic: Everyone should use Git and GitHub?
Replies: 9
Views: 306072

Re: Everyone should use Git and GitHub?


commit -a ?
It looks dangerous 😅


It commits only files that are in the Git repository, not any other files. Besides, you can always undo or modify your last commit before you have uploaded/pushed it to GitHub. :)

You can also commit individual files like this:
git commit -m "Describe my ...
by pandunia-guru
2024-10-16 16:29
Forum: english tok
Topic: Spammers have arrived
Replies: 2
Views: 27127

Re: Spammers have arrived

I read that spam bots can easily pass the GD image test. So I changed spammer detection to a Q&A system with one simple question in Pandunia. :twisted: