Competition Live

Competition Hub

The competition is underway! Download your data and review the problem statement below.

Check-in: Hilton Center for Business, 3rd Floor, Room 300

7:00 – 7:45 AM · Registration

Problem Statement

The California Ban on Prior Pay

Examine whether California's ban on salary history inquiries has impacted pay equity. Develop a model to test for pay discrimination and recommend whether a national ban should be implemented.

Download Problem Statement (.pdf)

Presentation Guidelines

Timeline

  • 8:15 AM Competition begins
  • 12:15 PM Submissions due
  • 1:30 PM Presentations begin
  • 10 min Presentation to judges
  • 5 min Q&A with judges

Judging Categories

  • Reasonableness of Results
    Can they explain why they got their numbers?
  • Analytical Rigor
    Did they explain why they chose their approach?
  • Presentation
    Could a non-technical executive follow this?
  • Innovation & Creativity
    Did they show me something I didn't expect?

Submit your presentation by 12:15 PM

Upload your slides using your team's unique submission link, available at lmudatathon.com/submissions/links.

Full Dataset

✓ Available

The complete NLS labor market dataset (194,545 rows) for your analysis.

Download Full CSV
Host: hopper.proxy.rlwy.net
Port: 37403
Database: railway
User: participant
Password: EvW27867KZfZgl1EH56vxQ
Table: graduate.nls_full

Read-only PostgreSQL database. The sample table (graduate.nls_sample) is also still available.

Sample Dataset

A preview of the National Longitudinal Survey (NLS) labor market data (100 rows) to explore the structure.

Download Sample CSV

Data Dictionary

Variable definitions for the NLS dataset including demographic codes, employment classifications, wage measures, and occupation/industry groupings.

Download Dictionary (.csv)

Domain Overview

You'll be working with labor economics data from the National Longitudinal Survey of Youth 1997 (NLSY97).

The NLSY97 is a nationally representative survey tracking 8,984 individuals born between 1980-1984, following them from adolescence into adulthood. The dataset captures employment histories, hourly wages, occupation and industry classifications, education levels, and demographic information across multiple survey rounds from 1997 to 2021.

Key concepts to familiarize yourself with: longitudinal panel data analysis, wage determination models, and state-level labor market regulations that aim to address pay equity.