Questions tagged [natural-language]

Natural language processing (NLP): analyzing and generating text in human languages

Natural language processing (NLP) refers to the analysis of text written in human languages, covering grammatical analysis, understanding, generation, translation, etc.

A spell checker that matches against a pre-defined set of words doesn't qualify as NLP (see the tag instead), but most more advanced tasks do. This tag is for tasks related to text stored in text form; for speech recognition and synthesis, use the tags and instead.

Natural language processing is an advanced subject and most tasks will require some programming. See NLP on Stack Overflow.

10 questions
41
votes
5 answers

Converting a UTF-8 file to ASCII (best-effort)

I have a file in UTF-8 that contains texts in multiple languages. A lot of it are people's names. I need to convert it to ASCII and I need the result to look as decent as possible. There are many ways how to approach converting from a wider encoding…
user7610
  • 1,878
  • 2
  • 18
  • 22
7
votes
1 answer

Is there a Unix command that searches for similar strings, based mostly on how they sound when spoken?

I have a file of names, and I want to search within it, not caring too much about whether I have spelled the name ( that I am searching for ) correctly. I know that grep has quite a bit of functionality to search for a whole slew of similar strings…
gabkdlly
  • 173
  • 5
4
votes
1 answer

How to make this conky (Conky Vision) use other language than English?

I want to use this conky script: Conky Vision But I don't want the days of the week to be displayed in English. When I change my locale to another language, the day of today is displayed in that language but the 5-day names from the lower part of…
user32012
4
votes
3 answers

Frequency of words in non-English language text: how can I merge singular and plural forms etc.?

I'm sorting French language words in some text files according to frequency with a focus on insight rather than statistical significance. The challenge is about preserving accented characters and dealing with the article forms in front of vowels(l',…
user44370
2
votes
1 answer

Text prediction in Linux while typing like on Android, iOS and Windows

I have noticed the utility of this feature while typing on Android devices (notably with Gboard virtual keyboard) The same is available on iOS, in iPhones and iPads. I don't mean the use of a virtual keyboard on Linux, but the presence of a…
1
vote
1 answer

Linux Mint doesn't write Arabic letters

I installed arabic fonts for Linux mint and i can switch between arabic and English, but it seems that mint cannot write arabic letters for example renaming a file or writing in any text-editor, when typing in arabic nothing appears. Note that it…
1
vote
2 answers

Where can I find a dictionary file of common words?

It's easy to generate a strong password quickly using the system dictionary: $ for i in {1..4}; do shuf --head-count=1 /usr/share/dict/words; done Amelanchier whitecup ankhs antispasmodics However, this isn't exactly the easiest list of words to…
l0b0
  • 50,672
  • 41
  • 197
  • 360
0
votes
1 answer

exp@@glibc_2.2.5 undefined reference

I've been trying to use the SVM-python implementation provided at tfinley.net. When I use the command make on the extracted directory, /usr/bin/ld: svm_light/svm_common.o: undefined reference to symbol…
AdveRSAry
  • 101
  • 1
  • 4
0
votes
1 answer

Indian Languages not available on Libreoffice Impress 5

I need to use Indian Language 'Marathi' to prepare a presentation. But there is no Marathi or Hindi to set as document language. Probably therefore even when I try to change the properties (e.g., font size) of the Marathi text I enter, things happen…
0
votes
1 answer

How can I translate in the CLI an English word into a German one?

I want to write a Script that picks a random English word from /usr/share/dict/words, translates it into German, display both of them for a certain amount of time and repeat the process. I only know the beginning part and do not know how to use a…
Abdul Al Hazred
  • 25,760
  • 23
  • 64
  • 88