Skip to main content
1 vote
0 answers
62 views

I'm trying in vain to make NLTK show a Frequency Distribution Plot using FreqDist.plot(). I've followed the steps from the first chapter of the NLTK book (scroll down to "3.1 Frequency ...
tohuwawohu's user avatar
  • 13.8k
0 votes
1 answer
393 views

I'm attempting to perform Part-of-Speech (POS) tagging in a Jupyter Notebook using the NLTK package. The averaged_perceptron_tagger resource cannot be located, according to the LookupError I keep ...
Aryan Rajyaguru's user avatar
3 votes
1 answer
99 views

Below code tokenises the text and identifies the grammar of each tokenised word. import nltk from nltk.tokenize import sent_tokenize, word_tokenize from nltk.corpus import wordnet as wn #nltk....
Ali's user avatar
  • 31
0 votes
1 answer
91 views

I am attempting to create a Python transform that requires me to import nltk. When I import nltk, later on I get: Resource [93mpunkt_tab[0m not found. Please use the NLTK Downloader to obtain the ...
omnes_flumina's user avatar
0 votes
1 answer
41 views

I have the following trained classifier: classifier = nltk.NaiveBayesClassifier.train(features[:train_count]) When I try to use it to classify(): result = classifier.classify(feature) and feature is ...
khteh's user avatar
  • 4,290
0 votes
2 answers
207 views

Repeated Lookup error eventhough NLTK is downloaded: Resource [93mpunkt_tab[0m not found. Please use the NLTK Downloader to obtain the resource: 31m>>> import nltk nltk.download('...
Ellster's user avatar
0 votes
1 answer
118 views

Sometimes, a Python library depends on additional data, such as ML models. This could be a model from transformers, spacy, nltkand so on. Typically there is a command to download such a model: python -...
jdm's user avatar
  • 10.4k
1 vote
1 answer
93 views

How do I get rid of non-printing (escaped) characters from the output of the nltk.word_tokenize method? I am working through the book 'Natural Language Processing with Python' and am following the ...
green_ruby's user avatar
0 votes
0 answers
61 views

When tokenizing paragraphs to sentences in the Russian language, I am observing the special case when the sequence is not treated as the end of the sentence. The case is with the я. at the end of the ...
pepr's user avatar
  • 21.1k
0 votes
0 answers
49 views

When tokenizing paragraphs in the Czech language, I am observing that some abbreviations are not treated as abbreviations. The paragraph is stored in the file as one long line. The nltk is of the ...
pepr's user avatar
  • 21.1k
1 vote
0 answers
34 views

Title: ValueError: could not convert string to float when training GaussianNB for SMS Spam Detection Body: I'm building an SMS spam detection tool and encountering an error while predicting with a ...
Aditya Kumar's user avatar
-1 votes
2 answers
2k views

I’m trying to perform Named Entity Recognition (NER) using NLTK, SpaCy, and a dataset in PyCharm. However, I’m encountering an error related to a missing resource (punkt_tab) when tokenizing text. ...
Nurul Zulaiqha's user avatar
0 votes
0 answers
190 views

I have been trying to use nltk's entity chunker, and tried different approaches but I keep getting the error: LookupError Traceback (most recent call last) ... ...
Sarah Tomori's user avatar
0 votes
1 answer
64 views

I have installed nltk through cmd on my windows 10 but when I try to import it in IDLE it does not work. Can someone please help? [This is the image of the installationThis is the image of the error ...
Sahil Patel's user avatar
2 votes
2 answers
117 views

I was trying to understand how BLEU score works and noticed that if I had to compute the n-gram precisions and have multiple reference sentences, it makes more sense to turn everything into sets to ...
Yuirike's user avatar
  • 41

15 30 50 per page
1
2 3 4 5
473