Picture for Marek Cygan

Marek Cygan

NoMagic.AI, Institute of Informatics, University of Warsaw

RoboMorph: Evolving Robot Morphology using Large Language Models

Add code
Jul 11, 2024
Viaarxiv icon

Bigger, Regularized, Optimistic: scaling for compute and sample-efficient continuous control

Add code
May 25, 2024
Viaarxiv icon

A Case for Validation Buffer in Pessimistic Actor-Critic

Add code
Mar 01, 2024
Viaarxiv icon

Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning

Add code
Mar 01, 2024
Viaarxiv icon

Scaling Laws for Fine-Grained Mixture of Experts

Add code
Feb 12, 2024
Viaarxiv icon

Decoupled Actor-Critic

Add code
Oct 30, 2023
Viaarxiv icon

Mixture of Tokens: Efficient LLMs through Cross-Example Aggregation

Add code
Oct 24, 2023
Viaarxiv icon

Grasping Student: semi-supervised learning for robotic manipulation

Add code
Mar 08, 2023
Viaarxiv icon

Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding

Add code
Nov 07, 2022
Figure 1 for Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Figure 2 for Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Figure 3 for Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Figure 4 for Polite Teacher: Semi-Supervised Instance Segmentation with Mutual Learning and Pseudo-Label Thresholding
Viaarxiv icon

On All-Action Policy Gradients

Add code
Oct 24, 2022
Viaarxiv icon