Search found 53 matches

by pandunia-guru
2025-01-11 18:44
Forum: Panlexia project
Topic: Mapping ids (Concepticon, WOLD, NELex)
Replies: 13
Views: 1460

Re: Mapping ids (Concepticon, WOLD, NELex)

Concept id mapping is now in data/id_map.tsv.

Today I found out that also Concepticon includes translations, see the files in the mappings folder. :o
by pandunia-guru
2025-01-04 14:33
Forum: Panlexia project
Topic: New concept ids
Replies: 4
Views: 303

Re: New concept ids

seweli wrote: 2025-01-03 20:01 What do we do with the two PLX:that.pro in eng-definition.tsv ?
Oops! Let's change the other to PLX:which.pro.
by pandunia-guru
2025-01-02 07:21
Forum: Panlexia project
Topic: New concept ids
Replies: 4
Views: 303

Re: New concept ids


Which one "house" concept will get the number 1 and why?


From Panlexia's point of view the number is opaque i.e. we don't know and we don't care! :D But if you really want to know, the number comes from Wordnet. Let's use the Python terminal to find out. First, let's list all Wordnet synsets ...
by pandunia-guru
2024-12-28 17:51
Forum: Panlexia project
Topic: New concept ids
Replies: 4
Views: 303

New concept ids

Creating concept ids by semantic categorization turned out to be too big work and also in some sense a futile goal. Semantic and thematic categorization of concepts is interesting in itself, but it's probably better to tackle it as a separate work. We need the concept ids only to link definitions ...
by pandunia-guru
2024-12-25 21:26
Forum: english tok
Topic: Word missing in dictionary; but in a lesson?
Replies: 1
Views: 285

Re: Word missing in dictionary; but in a lesson?

sal! Nice to have you here in this forum. :)

At the moment it is best if you tell me.

The dictionary is moving to Panlexia . If you are brave, you can also create a pull request there. In that case you should add the word in pandunia.tsv with a concept id from master.tsv . (If a suitable concept ...
by pandunia-guru
2024-12-13 13:23
Forum: Panlexia project
Topic: Open Multilingual Wordnet (OMW)
Replies: 1
Views: 373

Re: Open Multilingual Wordnet (OMW)

I added WordNet offset and PoS columns to the master.tsv in this change . A Concepticon source file (Borin-2015-1532.tsv) offered 1373 ready-made ids, but the rest we have to fill in by hand. This id gives us the access to words in many languages via OMW.

In Python, we can use the Natural Language ...
by pandunia-guru
2024-11-27 07:49
Forum: Panlexia project
Topic: Not much of time
Replies: 1
Views: 284

Re: Not much of time

Thanks for your help! You have given good feedback and found good material (like Concepticon!) for the project.

My original plan was to finalize the concept ids by the end of this year, but maybe it won't happen because December is busy family time. 😏

Good luck with your other projects!
by pandunia-guru
2024-11-18 18:25
Forum: Panlexia project
Topic: octosql
Replies: 1
Views: 303

Re: octosql

Panlexia is perfect material for database software! Good luck and feel free to upload your programs into our repository also for others to use.
by pandunia-guru
2024-11-14 06:04
Forum: Panlexia project
Topic: Generating bilingual dictionaries
Replies: 5
Views: 553

Re: Generating bilingual dictionaries

danke! :D

By the way, your screenshot shows bad alphabetical sorting. Upper-case "I" is before lower-case "b"! So I had to add a little code for case-insensitive sorting.

Code: Select all

# Sort the words in case-insensitive way.
sorted_dict = sorted(dict, key=lambda s: s[0].casefold())
by pandunia-guru
2024-11-12 15:42
Forum: Panlexia project
Topic: Synonyms
Replies: 1
Views: 292

Re: Synonyms

As the first step we should fill in the style field some information about synonyms. Usually synonyms are used in different situations.

First we need to define a list of styles, like dialectal, slang, pejorative, technical, humoristic, etc. Do you know a good list?