Latest articles and news
I write about Data, Machine Learning, Code, Projects or Ideas
- Panama Papers Investigation using Entity Resolution and Entity Linking
This article demonstrates how developers or investigative journalists can use Senzing entity resolution (ER) to work with unstructured documents. In particular, given that ER has been used with structured data sources to construct a domain-specific KG, the results of ER can be leveraged to customise entity linking (EL) downstream, for example using spaCy — as an alternative to off-the-shelf EL sources such as DBPedia.
- #NLP
- (non) Alternatives to dbt
During this talk, I’ll be sharing tips to effectively “sell” dbt and the modern data stack, and insights on the current landscape of data transformation tools.
- #Projects
- #Data
- Reviewing my first research paper on EasyChair
As a PC member for SofLiM24 (ISWC24), I have peer-reviewed my first research paper. The title: "Dynamic ontology serialization using ontology mapping paths".
- #Projects
- #NLP
- Functional Programming for Pandas Data Engineering
In this post I give tips on how to write pandas code for Data Engineering.
- #Data
- NER models in Argilla
In this talk, I explain how to use Argilla for Named Entity Recognition (NER). We will look at a real-life practical example on a dataset of football news. Argilla is a fast-moving open-source data curation platform. It can be used across the MLOps lifecycle, for data labelling and model monitoring. It is tightly integrated with HuggingFace. It is built on top of ElasticSearch and FastApi.
- #Projects
- #NLP
- Graphs and Language
In this blog post, we will explore how Knowledge Graphs can benefit from LLMs, and vice versa.
- #NLP
- #Model
- #Projects
- How To Build and Interpret a Nomogram for Setting Better Running Goals
I built a tool called a Nomogram to help me set better running goals, which I will share in this article. Using this tool can help you set difficult and specific goals.
- #Data
- When Natural Language Processing (NLP) meets Football
Louis Guitton, Principal Machine Learning Engineer for OneFootball: Scaling football news ingestion with Graph-based Natural Language Processing
- #Projects
- #Model
- #NLP
- A lightweight alternative to Amundsen for your dbt project
In this post, I'll show you how to build a lightweight data catalog on top of dbt artifacts using Algolia.
- #Data
- How to parse dbt artifacts
In this post, I'll show you how to get started with dbt artifacts, and how to parse them to unlock applications valuable to your use case.
- #Data
- How to monitor your FastAPI service
In this post I explain how to get visibility into your FastAPI application's performance for both development and production environments.
- #Code
- Learn SPARQL in 5 minutes and use it to query WikiData
Learn how to formulate questions to WikiData using SPARQL.
- #Data