Newest 'named-entity-recognition' Questions

Advice

0 votes

1 replies

44 views

Organisation/Person tagging using Spacy

We’re working on a problem where our master dataset contains names of organizations and individuals, but some entries are untagged. We only have the names (no additional details such as email or ...

MJ17

109

asked Nov 12, 2025 at 17:03

0 votes

1 answer

186 views

How to make Microsoft Presidio detect and mask Indian names and unusual text patterns in banking data?

I’m working on anonymizing PII in banking text using Microsoft Presidio . The built-in PERSON recognizer (which uses spaCy under the hood) works for some Western names and when the sentence is clear ...

karthik

9

asked Sep 24, 2025 at 8:34

1 vote

1 answer

61 views

What does True positive, FP, TN, FN corresponds when you do NER (Named Entity Recognition) in NLP?

I have been struggling with the concept of TP, TN, FP, FN Here is a text : My age is 25 years old, I am born on 20/02 of 2000 Let's say we have a nlp model (BERT) that extract the year you are born In ...

Solal Peiffer-Smadja

39

asked Jul 31, 2025 at 9:24

0 votes

1 answer

71 views

Converting data into spacy format "convert_to_spacy_format" in Name entity recognition Model

Dataset structureCan somebody help me with the NER model in converting the data into spacy format. The dataset format is shown in the screenshot here (https://www.kaggle.com/datasets/naseralqaydeh/...

Rohit Gupta

1

asked Mar 19, 2025 at 17:43

0 votes

0 answers

63 views

NER Results Empty in Query Processing Despite Parallel Intent and Entity Extraction (Showing empty NER Results)

This code is part of an API endpoint that processes a user's query in a conversational system. It tracks the execution time for each step, including fetching user data, refining the query based on ...

Tanuj Kanda

11

asked Mar 4, 2025 at 11:17

0 votes

0 answers

51 views

Relation Extraction Model returns only one entity instead of entity pairs

I'm working on a relation extraction model task using a transformer-based model. the `pipeline is expected to extract entity pairs along with their labelled relation labels. When I run the evaluation ...

user29424767

1

asked Jan 31, 2025 at 3:57

-1 votes

2 answers

2k views

Why am I getting a LookupError: Resource punkt_tab not found in NLTK even after downloading punkt?

I’m trying to perform Named Entity Recognition (NER) using NLTK, SpaCy, and a dataset in PyCharm. However, I’m encountering an error related to a missing resource (punkt_tab) when tokenizing text. ...

Nurul Zulaiqha

1

asked Jan 10, 2025 at 17:16

3 votes

1 answer

206 views

Attaching custom KB to Spacy "entity_linker" pipe makes NER calls very poor

I want to run an entity linking job using a custom Knowledgebase alone, and not use the second step ML re-ranker that requires a training dataset / Spacy corpus. I want the NEL pipeline to only assign ...

lrthistlethwaite

554

asked Dec 18, 2024 at 19:29

1 vote

1 answer

167 views

How to use Hugging Face model with 512 max tokens on longer text (for Named Entity Recognition)

I have been using the Named Entity Recognition (NER) model https://huggingface.co/cahya/bert-base-indonesian-NER on Indonesian text as follows: text = "..." model_name = "cahya/bert-...

Mauro Escudero

11

asked Nov 20, 2024 at 15:03

2 votes

1 answer

65 views

Handling Multiple Entity Candidates in Short Texts for Entity Linking with SciSpacy

I am working on linking short texts to entities in a biomedical knowledge graph (UMLS CUIs) using SciSpacy for a research project. The goal is to analyze the relationship between the linked entity and ...

GrimSqueaker

432

asked Nov 7, 2024 at 8:52

0 votes

1 answer

57 views

OutOfMemory while training pre-trained BERT model for token classification task

I am using pre-trained BertForTokenClassification for nested Named Entities Recognition task. To define nested entities, I am using multi-labels method. In the output model returns 3 lists of logits, ...

Alexandr Duck

1

asked Oct 25, 2024 at 17:45

0 votes

1 answer

275 views

How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?)

How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?) I have short product descriptions that I’d like to transform into structured attributes. Example: Input: “La Lecciaia ...

Jeff Em

13

asked Oct 21, 2024 at 20:54

1 vote

0 answers

108 views

Replace Prediction Head for XLM-RoBERTa-Base

I want to replace the XLM-RoBERTa-Base prediction head for a NER task (9 tags) with the weights for very specific words. For example, for the NER tag "B-PER" I would like to use the weights ...

Martin

11

asked Oct 9, 2024 at 3:21

0 votes

1 answer

597 views

How do I add a CRF layer to a BERT model for NER tasks?

I have created an NER model using BERT to detect medical entities which works great. I'm trying to add a CRF layer on top of my BERT model to enhance its performances but I'm getting an error that I ...

Akram H

71

asked Sep 25, 2024 at 12:20

2 votes

0 answers

140 views

Using NER to label big parts of text

I'm trying to process a CV-like text, more exactly to split it into parts by their meaning (Description, Contacts, Experience, Education, Certifications etc). Would NER be suitable for this purpose (...

Andrei Yusupau

654

asked Sep 8, 2024 at 10:11

Collectives™ on Stack Overflow

Organisation/Person tagging using Spacy

How to make Microsoft Presidio detect and mask Indian names and unusual text patterns in banking data?

What does True positive, FP, TN, FN corresponds when you do NER (Named Entity Recognition) in NLP?

Converting data into spacy format "convert_to_spacy_format" in Name entity recognition Model

NER Results Empty in Query Processing Despite Parallel Intent and Entity Extraction (Showing empty NER Results)

Relation Extraction Model returns only one entity instead of entity pairs

Why am I getting a LookupError: Resource punkt_tab not found in NLTK even after downloading punkt?

Attaching custom KB to Spacy "entity_linker" pipe makes NER calls very poor

How to use Hugging Face model with 512 max tokens on longer text (for Named Entity Recognition)

Handling Multiple Entity Candidates in Short Texts for Entity Linking with SciSpacy

OutOfMemory while training pre-trained BERT model for token classification task

How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?)

Replace Prediction Head for XLM-RoBERTa-Base

How do I add a CRF layer to a BERT model for NER tasks?

Using NER to label big parts of text

Hot Network Questions