Article List - NewsStore

Machine Learning Sovit Ranjan R…

Introduction to Moondream3 and Tasks

In this article, we cover Moondream3, the latest iteration in Moondream VLM family. We cover the model architecture and carry out inference using the...

1 month, 3 weeks ago • DebuggerCafe

351 words 1 min

Machine Learning Sovit Ranjan R…

DINOv3 with RetinaNet Head for Object Detection

In this article, we modify the DINOv3 backbone with RetinaNet head for object detection. We train the model on the Pascal VOC dataset and carry out in...

1 month, 4 weeks ago • DebuggerCafe

348 words 1 min

Machine Learning Sovit Ranjan R…

Object Detection with DINOv3

In this article, we modify the DINOv3 model for object detection and train in on the Pascal VOC detection dataset. We discuss the model creation, trai...

2 months ago • DebuggerCafe

332 words 1 min

Machine Learning Sovit Ranjan R…

Semantic Segmentation with DINOv3

In this article, we convert the DINOv3 model for semantic segmentation and train it on the Pascal VOC segmentation dataset along with analysis of the...

2 months, 1 week ago • DebuggerCafe

360 words 1 min

Machine Learning Sovit Ranjan R…

Image Classification with DINOv3

In this article, we explore DINOv3 for image classification on a card image classification dataset. We cover the DINOv3 models, the model code, traini...

2 months, 2 weeks ago • DebuggerCafe

332 words 1 min

Machine Learning Sovit Ranjan R…

Training Gemma 3n for Transcription and Translation

In this article, we are training the Gemma 3n model for transcription and translation of German audio files to English using the Unsloth library and c...

2 months, 3 weeks ago • DebuggerCafe

379 words 1 min

Machine Learning Sovit Ranjan R…

Fine-Tuning Gemma 3n for Speech Transcription

In this article, we are fine-tuning Gemma 3n for German speech transcription using the Unsloth library and running evaluations before and after traini...

3 months ago • DebuggerCafe

344 words 1 min

Machine Learning Sovit Ranjan R…

Multimodal Gradio App with Together AI

In this article, we create a multimodal Gradio application with Together AI models for chatting LLMs & VLMs, generating images, and automatic spe...

3 months, 1 week ago • DebuggerCafe

367 words 1 min

Machine Learning Sovit Ranjan R…

Serverless Inference with Together AI

In this article, we explore Together AI, a serverless generative AI platform for text generation, vision language models, image generation, and more....

3 months, 2 weeks ago • DebuggerCafe

326 words 1 min

Machine Learning Sovit Ranjan R…

Background Replacement Using BiRefNet

In this article, we create a background replacement application using BiRefNet. We cover the code using Jupyter Notebook and create a Gradio applicati...

3 months, 3 weeks ago • DebuggerCafe

336 words 1 min