Newest 'perplexity' Questions

0 votes

0 answers

81 views

Perplexity User Location filter dont work

I have a simple question for people used to Perplexity, this is what appears in the perplexity documentation: https://docs.perplexity.ai/guides/user-location-filter-guide#examples import requests url ...

Udl David

55

asked Jun 7, 2025 at 20:20

0 votes

0 answers

72 views

LDA perplexity with train-test split leads to absurd results (best model = 1 topic)

I'm working with LDA on a Portuguese news corpus (~800k documents with an average of 28 words each after cleaning the data), and I’m trying to evaluate topic quality using perplexity. When I compute ...

O Basile

1

asked Apr 15, 2025 at 0:02

1 vote

1 answer

401 views

Test a Perplexity API key on python

I don't find an efficient way to check if an api key of perplexity is valid, either on python or anything. Indeed, for openai I do : def check_openai_api_key(api_key): openai.api_key = api_key ...

Digicem

31

asked Dec 12, 2024 at 18:56

0 votes

1 answer

96 views

Why is Perplexity not reliable for open domain text generation tasks?

In the paper here, it says that perplexity as an automated metric is not reliable for open domain text generation tasks, but it instead uses lm-score, a model based metric to produce perplexity like ...

Sahil Yerawar

65

asked Jun 16, 2024 at 2:23

2 votes

2 answers

2k views

How to get perplexity per token rather than average perplexity?

I can get the perplexity of a whole sentence from here: device = "cuda" from transformers import GPT2LMHeadModel, GPT2TokenizerFast device = "cuda" model_id = "gpt2" ...

Penguin

2,651

asked Nov 6, 2023 at 17:30

1 vote

0 answers

319 views

Perplexity metric for GPT2 model is lower for non-English text

I am currently working on a project of calculating perplexities of various causal LLMs for different languages to estimate their behaviour if there is an input in a form of the language, that ...

Nikita Volkov

11

asked Sep 11, 2023 at 16:05

1 vote

1 answer

557 views

Challenges when calculating perplexity: using bidirectional models, and dealing with large text size and values, are my approaches reasonable?

Challenges when calculating perplexity: is my approach reasonable? I am trying to find a pre-trained language model that will work best for my text. The text is pretty specific in its language and ...

Agnes

29

asked Jun 4, 2023 at 10:05

1 vote

1 answer

848 views

Why is perplexity calculation giving different results for the same input?

I'm following Huggingface doc on calculating the perplexity of fixed-length models. I'm trying to verify that the formula works for various strings and I'm getting odd behavior. In particular, they ...

Penguin

2,651

asked May 6, 2023 at 2:41

0 votes

1 answer

134 views

What is the held-out probability in Mallet LDA? How can we calculate Perplexity by the held-out probability?

I am new to mallet. Now I would like to get the perplexity scores for 10-100 topics in my lda model so I run the held-our probability, it gives me the value of -8926490.73103205 for topic=100, which ...

May3514

1

asked Oct 31, 2022 at 23:50

2 votes

1 answer

1k views

How to calculate perplexity of BERTopic?

Is there a way to calculate the perplexity of BERTopic? I am unable to find any such thing in the BERTopic library and in other places.

Inaam Ilahi

124

asked Aug 16, 2022 at 6:29

1 vote

1 answer

644 views

Large Language Model Perplexity

i am currently using GPT-3 and i am trying to compare its capabilities to related language models for my masters thesis. Unfortunatly GPT-3 is an API based application, so i am not really able to ...

Fabian

83

asked Mar 31, 2022 at 9:41

3 votes

0 answers

239 views

Diagnostics (perplexity, LogLik, etc) for LDA topic model with textmodel_seededLDA package in R

I'm using the seededLDA package to do an LDA topic model. However, all of the packages and functions I've found to compute perplexity, log likelihood, exclusivity, etc (and other diagnostic tools) don'...

Daniel Casey

53

asked Nov 2, 2021 at 22:08

0 votes

2 answers

3k views

How to find perplexity of bigram if probability of given bigram is 0

Given the formula to calculate the perplexity of a bigram (and probability with add-1 smoothing), Probability How does one proceed when one of the probabilities of the word per in the sentence to ...

axelmukwena

1,159

asked Mar 31, 2021 at 14:55

2 votes

0 answers

438 views

Why am I randomly getting super high perplexities?

A few days ago I finished writing a word prediction program that tests both LSTM and GRU models on a given dataset. I test 4 models - 2 LSTM models and 2 GRU models. I wrote the program on Google ...

Guy

163

asked Dec 9, 2020 at 22:20

0 votes

0 answers

756 views

Gensim perplexity score increases

I am trying to calculate the perplexity score in Spyder for different numbers of topics in order to find the best model parameters with gensim. However, the perplexity score is not decreasing as it is ...

blackmamba

15

asked Sep 21, 2020 at 18:46

Collectives™ on Stack Overflow

Perplexity User Location filter dont work

LDA perplexity with train-test split leads to absurd results (best model = 1 topic)

Test a Perplexity API key on python

Why is Perplexity not reliable for open domain text generation tasks?

How to get perplexity per token rather than average perplexity?

Perplexity metric for GPT2 model is lower for non-English text

Challenges when calculating perplexity: using bidirectional models, and dealing with large text size and values, are my approaches reasonable?

Why is perplexity calculation giving different results for the same input?

What is the held-out probability in Mallet LDA? How can we calculate Perplexity by the held-out probability?

How to calculate perplexity of BERTopic?

Large Language Model Perplexity

Diagnostics (perplexity, LogLik, etc) for LDA topic model with textmodel_seededLDA package in R

How to find perplexity of bigram if probability of given bigram is 0

Why am I randomly getting super high perplexities?

Gensim perplexity score increases

Hot Network Questions