Skip to main content
Advice
0 votes
1 replies
44 views

We’re working on a problem where our master dataset contains names of organizations and individuals, but some entries are untagged. We only have the names (no additional details such as email or ...
MJ17's user avatar
  • 109
0 votes
1 answer
186 views

I’m working on anonymizing PII in banking text using Microsoft Presidio . The built-in PERSON recognizer (which uses spaCy under the hood) works for some Western names and when the sentence is clear ...
karthik's user avatar
1 vote
1 answer
61 views

I have been struggling with the concept of TP, TN, FP, FN Here is a text : My age is 25 years old, I am born on 20/02 of 2000 Let's say we have a nlp model (BERT) that extract the year you are born In ...
Solal Peiffer-Smadja's user avatar
0 votes
1 answer
71 views

Dataset structureCan somebody help me with the NER model in converting the data into spacy format. The dataset format is shown in the screenshot here (https://www.kaggle.com/datasets/naseralqaydeh/...
Rohit Gupta's user avatar
0 votes
0 answers
63 views

This code is part of an API endpoint that processes a user's query in a conversational system. It tracks the execution time for each step, including fetching user data, refining the query based on ...
Tanuj Kanda's user avatar
0 votes
0 answers
51 views

I'm working on a relation extraction model task using a transformer-based model. the `pipeline is expected to extract entity pairs along with their labelled relation labels. When I run the evaluation ...
user29424767's user avatar
-1 votes
2 answers
2k views

I’m trying to perform Named Entity Recognition (NER) using NLTK, SpaCy, and a dataset in PyCharm. However, I’m encountering an error related to a missing resource (punkt_tab) when tokenizing text. ...
Nurul Zulaiqha's user avatar
3 votes
1 answer
206 views

I want to run an entity linking job using a custom Knowledgebase alone, and not use the second step ML re-ranker that requires a training dataset / Spacy corpus. I want the NEL pipeline to only assign ...
lrthistlethwaite's user avatar
1 vote
1 answer
167 views

I have been using the Named Entity Recognition (NER) model https://huggingface.co/cahya/bert-base-indonesian-NER on Indonesian text as follows: text = "..." model_name = "cahya/bert-...
Mauro Escudero's user avatar
2 votes
1 answer
65 views

I am working on linking short texts to entities in a biomedical knowledge graph (UMLS CUIs) using SciSpacy for a research project. The goal is to analyze the relationship between the linked entity and ...
GrimSqueaker's user avatar
0 votes
1 answer
57 views

I am using pre-trained BertForTokenClassification for nested Named Entities Recognition task. To define nested entities, I am using multi-labels method. In the output model returns 3 lists of logits, ...
Alexandr Duck's user avatar
0 votes
1 answer
275 views

How to derive attributes/labels from short plain text descriptions? (NER, LLM, ?) I have short product descriptions that I’d like to transform into structured attributes. Example: Input: “La Lecciaia ...
Jeff Em's user avatar
  • 13
1 vote
0 answers
108 views

I want to replace the XLM-RoBERTa-Base prediction head for a NER task (9 tags) with the weights for very specific words. For example, for the NER tag "B-PER" I would like to use the weights ...
Martin's user avatar
  • 11
0 votes
1 answer
597 views

I have created an NER model using BERT to detect medical entities which works great. I'm trying to add a CRF layer on top of my BERT model to enhance its performances but I'm getting an error that I ...
Akram H's user avatar
  • 71
2 votes
0 answers
140 views

I'm trying to process a CV-like text, more exactly to split it into parts by their meaning (Description, Contacts, Experience, Education, Certifications etc). Would NER be suitable for this purpose (...
Andrei Yusupau's user avatar

15 30 50 per page
1
2 3 4 5
97