Article List

Explore latest news, discover interesting content, and dive deep into topics that interest you

Computer Vision Research David Stutz

Thoughts and Lessons for Planning Rater Studies in AI

With the goal of deploying generative AI systems, rater studies are becoming increasingly common and important. This means more and more researchers a...

1 year ago Blog Archiv…
13261 words 44 min
Essential Best Practices for Image Labeling: A Complete Guide for Model Accuracy Open Source Data Science

Essential Best Practices for Image Labeling: A Complete Gui…

Learn about the essential best practices for image labeling that can help you improve your computer vision model accuracy....

1 year ago DagsHub Blog
25407 words 84 min
Security Research

A 2-ply minimax chess engine in 84,688 regular expressions

I wrote a (list of) regular expressions that will play a (not very good) chess game by running a 2-ply minimax search....

1 year ago Nicholas Ca…
97 words 1 min
Can LLMs write better code if you keep asking them to “write better code”? Data Science

Can LLMs write better code if you keep asking them to “writ…

Most coders want AI to write code faster: I want AI to write FASTER CODE....

1 year ago Posts on Ma…
80649 words 268 min
Machine Learning Research Journal Shulei Wang

Linear Separation Capacity of Self-Supervised Representatio…

Recent advances in self-supervised learning have highlighted the efficacy of data augmentation in learning data representation from unlabeled data. Tr...

1 year ago JMLR
1164 words 3 min
Machine Learning Research Journal Jiacai Liu, We…

On the Convergence of Projected Policy Gradient for Any Con…

Projected policy gradient (PPG) is a basic policy optimization method in reinforcement learning. Given access to exact policy evaluations, previous st...

1 year ago JMLR
931 words 3 min
Machine Learning Research Journal Erhan Bayrakta…

Learning with Linear Function Approximations in Mean-Field …

The paper focuses on mean-field type multi-agent control problems with finite state and action spaces where the dynamics and cost structures are symme...

1 year ago JMLR
1150 words 3 min
Machine Learning Research Journal Junwen Qiu, Xi…

A New Random Reshuffling Method for Nonsmooth Nonconvex Fin…

Random reshuffling techniques are prevalent in large-scale applications, such as training neural networks. While the convergence and acceleration effe...

1 year ago JMLR
1260 words 4 min
Machine Learning Research Journal Rohit Kanrar, …

Model-free Change-Point Detection Using AUC of a Classifier

In contemporary data analysis, it is increasingly common to work with non-stationary complex data sets. These data sets typically extend beyond the cl...

1 year ago JMLR
1253 words 4 min
Machine Learning Research Journal Ilyas Fatkhull…

EF21 with Bells & Whistles: Six Algorithmic Extensions of M…

First proposed by Seide (2014) as a heuristic, error feedback (EF) is a very popular mechanism for enforcing convergence of distributed gradient-based...

1 year ago JMLR
1209 words 4 min
Machine Learning Research Journal Xin Xu, Eibe F…

Multiple Instance Verification

We explore multiple instance verification, a problem setting in which a query instance is verified against a bag of target instances with heterogeneou...

1 year ago JMLR
1072 words 3 min
Machine Learning Research Journal Ye Tian, Yuqi …

Learning from Similar Linear Representations: Adaptivity, M…

Representation multi-task learning (MTL) has achieved tremendous success in practice. However, the theoretical understanding of these methods is still...

1 year ago JMLR
1332 words 4 min
Machine Learning Research Journal Yanxin Jin, Ya…

Exponential Family Graphical Models: Correlated Replicates …

Graphical models have been used extensively for modeling brain connectivity networks. However, unmeasured confounders and correlations among measureme...

1 year ago JMLR
1158 words 3 min
Machine Learning Research Journal Bernardo Ávila…

Optimizing Return Distributions with Distributional Dynamic…

We introduce distributional dynamic programming (DP) methods for optimizing statistical functionals of the return distribution, with standard reinforc...

1 year ago JMLR
1186 words 3 min
Machine Learning Research Journal Vanessa Kosoy

Imprecise Multi-Armed Bandits: Representing Irreducible Unc…

We introduce a novel multi-armed bandit framework, where each arm is associated with a fixed unknown credal set over the space of outcomes (which can...

1 year ago JMLR
718 words 2 min
Machine Learning Research Journal Etienne Boursi…

Early Alignment in Two-Layer Networks Training is a Two-Edg…

Training neural networks with first order optimisation methods is at the core of the empirical success of deep learning. The scale of initialisation i...

1 year ago JMLR
896 words 2 min
Machine Learning Research Journal Xianghua Zeng,…

Hierarchical Decision Making Based on Structural Informatio…

Hierarchical Reinforcement Learning (HRL) is a promising approach for managing task complexity across multiple levels of abstraction and accelerating...

1 year ago JMLR
1501 words 5 min
Machine Learning Research Journal Matias G. Delg…

Generative Adversarial Networks: Dynamics

We study quantitatively the overparametrization limit of the original Wasserstein-GAN algorithm. Effectively, we show that the algorithm is a stochast...

1 year ago JMLR
598 words 1 min
129 / 159