Article List
Explore latest news, discover interesting content, and dive deep into topics that interest you
Prozessvisualisierung mit generativer KI im Praxistest
German article by Nils Durner on visualizing technical processes with Generative AI, featuring spaCy and Presidio for PII anonymization....
What the history of the web can teach us about the future o…
How will AI development look in the future? There is a lot we can learn from another groundbreaking technology: the web. This blog post takes a look a...
What the history of the web can teach us about the future o…
In this talk, Ines takes a look at what the history of the web can teach us about the future of AI, and what this means for developers, models, open s...
Using natural language processing to identify emergency dep…
CT reports were annotated by MD raters using Prodigy software to develop a stepwise NLP “pipeline” that first excluded prior or known malignancy, dete...
Best Way to OCR a PDF in Python
Tutorial by WJB Mattingly on how to use the new spaCy Layout package and Docling to convert PDFs to text....
Streaming spaCy
Join spaCy author and core developer Matt as he works on the library, develops features and fixes bugs, while chatting about all things NLP and open s...
Prodigy Dashboard Plugin
The new dashboard plugin adds a web application for managing annotations, data analytics and annotation progress, and is now available for early beta...
Cracking the Code: How to Start a Career in AI
Short video interview with Ines about the 4 skills job hunters can cultivate for a career in artificial intelligence....
spaCy Natural Language Processing: From Beginner to Advanced
The first Chinese-language book on spaCy for beginners and experienced practitioners, covering traditional NLP techniques and how to leverage LLMs for...
PyLadies entrepreneurs and career development
Panel discussion about career challenges and starting your own business with Cheuk Ting Ho, Tereza Iofciu, Anwesha Das, Una Galyeva and Ines....
Recognising non-named spatial entities in literary texts: a…
In this paper, we present a case study on the prediction of what we call ‘non-named spatial entities’ (NNSE) in a historical corpus of Swiss-German no...
From PDFs to AI-ready structured data: a deep dive
This blog post presents a new modular workflow for converting PDFs and similar documents to structured data and shows you how to build end-to-end docu...
🔌 prodigy-pdf v0.4.0
Add text-based span annotation for PDFs...
✨ prodigy v1.17.0
Pages UI for multi-page tasks like longer documents, PDFs or collections of images...
🔌 prodigy-pdf v0.3.0
Support multi-page PDFs in a single view...
uOttawa at LegalLens-2024: Transformer-based Classification…
Our training utilizes the spaCy pipeline configured with a transformer model and a transition-based parser for NER tasks. The deberta-v3-base model ha...
Distill Your LLMs and Surpass Their Performance
In her presentation at InfoQ Dev Summit, Ines Montani provided the audience with practical solutions for using the latest state-of-the-art models in r...
Serverless custom NLP with LLMs, Modal and Prodigy
In this blog post, we’ll show you how you can go from an idea and little data to a fully custom information extraction model using Prodigy and Modal,...