1,447 questions
Advice
0
votes
1
replies
44
views
Organisation/Person tagging using Spacy
We’re working on a problem where our master dataset contains names of organizations and individuals, but some entries are untagged. We only have the names (no additional details such as email or ...
0
votes
1
answer
186
views
How to make Microsoft Presidio detect and mask Indian names and unusual text patterns in banking data?
I’m working on anonymizing PII in banking text using Microsoft Presidio
.
The built-in PERSON recognizer (which uses spaCy under the hood) works for some Western names and when the sentence is clear
...
1
vote
1
answer
61
views
What does True positive, FP, TN, FN corresponds when you do NER (Named Entity Recognition) in NLP?
I have been struggling with the concept of TP, TN, FP, FN
Here is a text :
My age is 25 years old, I am born on 20/02 of 2000
Let's say we have a nlp model (BERT) that extract the year you are born
In ...
0
votes
1
answer
71
views
Converting data into spacy format "convert_to_spacy_format" in Name entity recognition Model
Dataset structureCan somebody help me with the NER model in converting the data into spacy format.
The dataset format is shown in the screenshot here (https://www.kaggle.com/datasets/naseralqaydeh/...
0
votes
0
answers
63
views
NER Results Empty in Query Processing Despite Parallel Intent and Entity Extraction (Showing empty NER Results)
This code is part of an API endpoint that processes a user's query in a conversational system. It tracks the execution time for each step, including fetching user data, refining the query based on ...
0
votes
0
answers
51
views
Relation Extraction Model returns only one entity instead of entity pairs
I'm working on a relation extraction model task using a transformer-based model. the `pipeline is expected to extract entity pairs along with their labelled relation labels. When I run the evaluation ...
-1
votes
2
answers
2k
views
Why am I getting a LookupError: Resource punkt_tab not found in NLTK even after downloading punkt?
I’m trying to perform Named Entity Recognition (NER) using NLTK, SpaCy, and a dataset in PyCharm. However, I’m encountering an error related to a missing resource (punkt_tab) when tokenizing text. ...
3
votes
1
answer
206
views
Attaching custom KB to Spacy "entity_linker" pipe makes NER calls very poor
I want to run an entity linking job using a custom Knowledgebase alone, and not use the second step ML re-ranker that requires a training dataset / Spacy corpus. I want the NEL pipeline to only assign ...
1
vote
1
answer
167
views
How to use Hugging Face model with 512 max tokens on longer text (for Named Entity Recognition)
I have been using the Named Entity Recognition (NER) model https://huggingface.co/cahya/bert-base-indonesian-NER on Indonesian text as follows:
text = "..."
model_name = "cahya/bert-...
2
votes
1
answer
65
views
Handling Multiple Entity Candidates in Short Texts for Entity Linking with SciSpacy
I am working on linking short texts to entities in a biomedical knowledge graph (UMLS CUIs) using SciSpacy for a research project. The goal is to analyze the relationship between the linked entity and ...
0
votes
1
answer
57
views
OutOfMemory while training pre-trained BERT model for token classification task
I am using pre-trained BertForTokenClassification for nested Named Entities Recognition task. To define nested entities, I am using multi-labels method. In the output model returns 3 lists of logits, ...
0
votes
1
answer
275
views
How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?)
How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?)
I have short product descriptions that I’d like to transform into structured attributes.
Example:
Input:
“La Lecciaia ...
1
vote
0
answers
108
views
Replace Prediction Head for XLM-RoBERTa-Base
I want to replace the XLM-RoBERTa-Base prediction head for a NER task (9 tags) with the weights for very specific words. For example, for the NER tag "B-PER" I would like to use the weights ...
0
votes
1
answer
597
views
How do I add a CRF layer to a BERT model for NER tasks?
I have created an NER model using BERT to detect medical entities which works great. I'm trying to add a CRF layer on top of my BERT model to enhance its performances but I'm getting an error that I ...
2
votes
0
answers
140
views
Using NER to label big parts of text
I'm trying to process a CV-like text, more exactly to split it into parts by their meaning (Description, Contacts, Experience, Education, Certifications etc).
Would NER be suitable for this purpose (...