Concept id mapping is now in data/id_map.tsv.
Today I found out that also Concepticon includes translations, see the files in the mappings folder.
Search found 53 matches
- 2025-01-11 18:44
- Forum: Panlexia project
- Topic: Mapping ids (Concepticon, WOLD, NELex)
- Replies: 13
- Views: 1460
- 2025-01-04 14:33
- Forum: Panlexia project
- Topic: New concept ids
- Replies: 4
- Views: 303
- 2025-01-02 07:21
- Forum: Panlexia project
- Topic: New concept ids
- Replies: 4
- Views: 303
Re: New concept ids
Which one "house" concept will get the number 1 and why?
From Panlexia's point of view the number is opaque i.e. we don't know and we don't care! :D But if you really want to know, the number comes from Wordnet. Let's use the Python terminal to find out. First, let's list all Wordnet synsets ...
- 2024-12-28 17:51
- Forum: Panlexia project
- Topic: New concept ids
- Replies: 4
- Views: 303
New concept ids
Creating concept ids by semantic categorization turned out to be too big work and also in some sense a futile goal. Semantic and thematic categorization of concepts is interesting in itself, but it's probably better to tackle it as a separate work. We need the concept ids only to link definitions ...
- 2024-12-25 21:26
- Forum: english tok
- Topic: Word missing in dictionary; but in a lesson?
- Replies: 1
- Views: 285
Re: Word missing in dictionary; but in a lesson?
sal! Nice to have you here in this forum. :)
At the moment it is best if you tell me.
The dictionary is moving to Panlexia . If you are brave, you can also create a pull request there. In that case you should add the word in pandunia.tsv with a concept id from master.tsv . (If a suitable concept ...
At the moment it is best if you tell me.
The dictionary is moving to Panlexia . If you are brave, you can also create a pull request there. In that case you should add the word in pandunia.tsv with a concept id from master.tsv . (If a suitable concept ...
- 2024-12-13 13:23
- Forum: Panlexia project
- Topic: Open Multilingual Wordnet (OMW)
- Replies: 1
- Views: 373
Re: Open Multilingual Wordnet (OMW)
I added WordNet offset and PoS columns to the master.tsv in this change . A Concepticon source file (Borin-2015-1532.tsv) offered 1373 ready-made ids, but the rest we have to fill in by hand. This id gives us the access to words in many languages via OMW.
In Python, we can use the Natural Language ...
In Python, we can use the Natural Language ...
- 2024-11-27 07:49
- Forum: Panlexia project
- Topic: Not much of time
- Replies: 1
- Views: 284
Re: Not much of time
Thanks for your help! You have given good feedback and found good material (like Concepticon!) for the project.
My original plan was to finalize the concept ids by the end of this year, but maybe it won't happen because December is busy family time.
Good luck with your other projects!
My original plan was to finalize the concept ids by the end of this year, but maybe it won't happen because December is busy family time.
Good luck with your other projects!
- 2024-11-18 18:25
- Forum: Panlexia project
- Topic: octosql
- Replies: 1
- Views: 303
Re: octosql
Panlexia is perfect material for database software! Good luck and feel free to upload your programs into our repository also for others to use.
- 2024-11-14 06:04
- Forum: Panlexia project
- Topic: Generating bilingual dictionaries
- Replies: 5
- Views: 553
Re: Generating bilingual dictionaries
danke!
By the way, your screenshot shows bad alphabetical sorting. Upper-case "I" is before lower-case "b"! So I had to add a little code for case-insensitive sorting.
By the way, your screenshot shows bad alphabetical sorting. Upper-case "I" is before lower-case "b"! So I had to add a little code for case-insensitive sorting.
Code: Select all
# Sort the words in case-insensitive way.
sorted_dict = sorted(dict, key=lambda s: s[0].casefold())
- 2024-11-12 15:42
- Forum: Panlexia project
- Topic: Synonyms
- Replies: 1
- Views: 292
Re: Synonyms
As the first step we should fill in the style field some information about synonyms. Usually synonyms are used in different situations.
First we need to define a list of styles, like dialectal, slang, pejorative, technical, humoristic, etc. Do you know a good list?
First we need to define a list of styles, like dialectal, slang, pejorative, technical, humoristic, etc. Do you know a good list?