natural language processing
PulseAugur coverage of natural language processing — every cluster mentioning natural language processing across labs, papers, and developer communities, ranked by signal.
- instance of named-entity recognition 90%
- used by Eugene Yan 70%
- used by Word2vec 70%
- used by electronic health records 70%
- affiliated with deep learning 70%
- instance of deep learning 70%
- used by named-entity recognition 70%
- instance of Gotit.pub 70%
- instance of ScienceCast 60%
- instance of alphaXiv 60%
- instance of CatalyzeX 60%
- instance of visual perception 60%
22 day(s) with sentiment data
-
AI automates property data extraction from real estate documents
Artificial intelligence is revolutionizing the real estate industry by automating the extraction of property information from various documents. Techniques such as Optical Character Recognition (OCR), Natural Language P…
-
New paper highlights "Annotation Scarcity Paradox" in low-resource NLP evaluation
A new paper published on arXiv discusses the "Annotation Scarcity Paradox" in low-resource Natural Language Processing (NLP). The paper argues that while NLP models have advanced rapidly, the human infrastructure needed…
-
Sophia NLU Home Assistant edition v1.3 enhances LLM-like understanding
Sophia NLU Home Assistant edition v1.3 has been released, offering improved understanding and a more fluid conversational experience. This update introduces a new 'Context Only' mode, designed to enhance compatibility a…
-
New ReTeX framework recovers task expert performance from merged AI models
Researchers have developed a new framework called ReTeX to address parameter interference in multi-task model merging. This method models interference as additive offsets and predicts these offsets to recover individual…
-
Study reveals 18.6% of online reviews show rating-sentiment incongruence · 3 sources tracked
A recent study published on arXiv investigated the discrepancy between star ratings and written sentiment in online reviews, particularly within Sri Lankan tourism. The research found that 18.6% of reviews exhibit this …
-
MARBERT model enhances Arabic tweet analysis for STC customer service
Researchers have developed a new method for sentiment and spam detection in Arabic tweets using the MARBERT model. This approach aims to improve customer service for Saudi Telecom Company (STC) by analyzing feedback on …
-
LLM evaluation metrics need confidence intervals to distinguish signal from noise
Evaluating Large Language Models (LLMs) requires understanding the uncertainty inherent in performance metrics. A single score, such as 84.2% accuracy, can be misleading because it doesn't account for sampling error. By…
-
New system measures academic paper difficulty and its link to impact
A new research paper proposes a system to quantitatively assess the difficulty of academic papers, particularly within Natural Language Processing (NLP). The system incorporates factors like collaboration, content, and …
-
New LLM system and dataset enhance product data extraction for Portuguese e-commerce
Researchers have developed AI-PAVE-Br, a system utilizing large language models to improve Product Attribute Value Extraction (PAVE) for Portuguese e-commerce data. This system is designed to handle the complexities and…
-
New algorithm tags Arabic-English dictionary senses using WordNet
Researchers have developed a novel algorithm for automatically assigning part-of-speech (POS) tags to senses within an Arabic-English bilingual dictionary. This method leverages Princeton WordNet to transfer POS tags fr…
-
New Czech language treebanks released for NLP research · 4 sources tracked
Researchers have released two new papers detailing advancements in Czech language processing resources. The first paper introduces the Prague Dependency Treebank -- Consolidated 2.0 (PDT-C 2.0), an extensive, uniformly …
-
New study analyzes algorithm influence using NLP paper co-occurrence networks
This research paper introduces a novel method for analyzing the academic influence of algorithms within the field of Natural Language Processing (NLP). By constructing large-scale co-occurrence networks from the full te…
-
Gender diversity in research teams linked to higher scientific impact
A new study published on arXiv explores the correlation between gender diversity in research teams and the scientific impact of their papers, specifically within the Natural Language Processing (NLP) and Library and Inf…
-
Fine-tuning LLMs: Data Labeling and Failure Analysis Are Key Challenges
Fine-tuning large language models offers a more efficient alternative to training from scratch, allowing users to adapt pre-existing models to specific tasks. However, the most challenging aspect of fine-tuning is not t…
-
New model learns annotator perspectives for moral text classification
Researchers have developed a new approach to moral classification of text by modeling individual annotator perspectives rather than relying on aggregated "ground truth" labels. This method extends pretrained language mo…
-
New method merges AI models without training for better performance
Researchers have developed a novel training-free method for merging multiple task-specific AI models into a single, more efficient multi-task model. This new approach, called SiM, uses singular value decomposition to ap…
-
New method ROMEVA improves Roman Urdu language model vocabulary
Researchers have developed ROMEVA, a novel method for expanding the vocabulary of multilingual language models like mBERT to better handle languages with inconsistent spelling, such as Roman Urdu. This approach combines…
-
Natural Language Processing powers AI and communication
Natural Language Processing (NLP) is the technology that enables devices to understand and generate human language, powering features like voice assistants and AI text generation. This field is fundamentally altering ho…
-
New corpus maps scientific research evolution with typed citations · 1 source tracked
Researchers have developed a new corpus called SciTraj to better understand how scientific research evolves by analyzing citation patterns. Unlike traditional citation graphs, SciTraj categorizes citation edges into six…
-
AI language tech needs plural epistemologies, not just data, study finds
A new paper proposes a socio-technical model for integrating cultural diversity into Natural Language Processing (NLP). The research argues that achieving cultural alignment requires more than just adding diverse data; …