Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Clear Filters
Machine Learning Sovit Ranjan R…

Introduction to Moondream3 and Tasks

In this article, we cover Moondream3, the latest iteration in Moondream VLM family. We cover the model architecture and carry out inference using the...

1 month, 3 weeks ago DebuggerCafe
351 words 1 min
Machine Learning Sovit Ranjan R…

DINOv3 with RetinaNet Head for Object Detection

In this article, we modify the DINOv3 backbone with RetinaNet head for object detection. We train the model on the Pascal VOC dataset and carry out in...

1 month, 4 weeks ago DebuggerCafe
348 words 1 min
Machine Learning Sovit Ranjan R…

Object Detection with DINOv3

In this article, we modify the DINOv3 model for object detection and train in on the Pascal VOC detection dataset. We discuss the model creation, trai...

2 months ago DebuggerCafe
332 words 1 min
Machine Learning Sovit Ranjan R…

Semantic Segmentation with DINOv3

In this article, we convert the DINOv3 model for semantic segmentation and train it on the Pascal VOC segmentation dataset along with analysis of the...

2 months, 1 week ago DebuggerCafe
360 words 1 min
Machine Learning Sovit Ranjan R…

Image Classification with DINOv3

In this article, we explore DINOv3 for image classification on a card image classification dataset. We cover the DINOv3 models, the model code, traini...

2 months, 2 weeks ago DebuggerCafe
332 words 1 min
Machine Learning Sovit Ranjan R…

Training Gemma 3n for Transcription and Translation

In this article, we are training the Gemma 3n model for transcription and translation of German audio files to English using the Unsloth library and c...

2 months, 3 weeks ago DebuggerCafe
379 words 1 min
Machine Learning Sovit Ranjan R…

Fine-Tuning Gemma 3n for Speech Transcription

In this article, we are fine-tuning Gemma 3n for German speech transcription using the Unsloth library and running evaluations before and after traini...

3 months ago DebuggerCafe
344 words 1 min
Machine Learning Sovit Ranjan R…

Multimodal Gradio App with Together AI

In this article, we create a multimodal Gradio application with Together AI models for chatting LLMs & VLMs, generating images, and automatic spe...

3 months, 1 week ago DebuggerCafe
367 words 1 min
Machine Learning Sovit Ranjan R…

Serverless Inference with Together AI

In this article, we explore Together AI, a serverless generative AI platform for text generation, vision language models, image generation, and more....

3 months, 2 weeks ago DebuggerCafe
326 words 1 min
Machine Learning Sovit Ranjan R…

Background Replacement Using BiRefNet

In this article, we create a background replacement application using BiRefNet. We cover the code using Jupyter Notebook and create a Gradio applicati...

3 months, 3 weeks ago DebuggerCafe
336 words 1 min