To measure progress in machine common sense, we're developing a suite of benchmark datasets.
Knowledge graphs provide useful semi-structured representations of commonsense. Currently, we're developing and exploring how to better use such graphs with current models.
Visual Commonsense Reasoning (VCR) is a new task and large-scale dataset for cognition-level visual understanding.
SWAG (Situations With Adversarial Generations) is a large-scale dataset for the task of grounded commonsense inference, unifying natural language inference and physically grounded reasoning.
Winogrande is a large-scale Winograd Schema Challenge dataset, adjusted to improve both the scale and the hardness of the dataset.