The Hidden Truth About LLM Performance: Why Your Benchmark Results Might Be Misleading
Published:
There are two systematic issues with benchmarks, which were already present in the community before the advent of LLMs but became worse …
Published:
There are two systematic issues with benchmarks, which were already present in the community before the advent of LLMs but became worse …
Published:
Today, Google launched a new set of models named Gemma. These models are based on the same tech and research used for creating the Gemini…
Published:
Napolab is here: a curated collection of Portuguese datasets designed for easy evaluation of language models.
Published:
Hashtag segmentation, the task of adding spaces between words in a hashtag, can now be done with Large Language Models (LLMs).
Published:
It’s no secret that the tech landscape is dynamic and ever-evolving. New technologies are born, they mature, and then, often, they are…
Published:
Abusive language detection, a critical aspect of modern NLP research, is often challenged by the lack of generalization across different…
Published:
I am thrilled to announce the latest milestone in the advancement of Portuguese language technology — the Albertina PT ! This breakthrough…
Published:
Word segmentation is the task of adding spaces between words. It can be an important preprocessing step in Natural Language Processing…
Published:
The cold start problem in NLP:
Published:
A calendar in the Wake
Published:
Update 03/21/2021: I published my modified version of run_glue.py as a public gist on GitHub.