USCIS AAO Dataset

Dataset explorer for schema metadata, statistics, and downloads.
Whole Set Cases
242
The full USCIS AAO set
Selected Set
Hard
28 cases ยท 11.57%
Content Scope
Schema + Stats
Raw cases are available through zip downloads

Choose Set

Switch between the generated stats views for whole, not-hard, and hard.
Whole
242 cases
100.00% of the whole set
Not-Hard
214 cases
88.43% of the whole set
Hard
28 cases
Selected
11.57% of the whole set

JSON Structure

7 keys
{
  "id": "uscis_xxxxxxxx",
  "text": "Case text",
  "statutes": "Statutes and legal standards text",
  "question": "Should this case be accepted or dismissed?",
  "label": "Accepted | Dismissed",
  "reference_prolog": "Reference Prolog program",
  "case_number": "APR092024_01E2309"
}

Field Metadata

Current Dataset
id string
Standard Value
Stable internal case identifier.
Description
Unique identifier for the case record in this dataset.
text string
Standard Value
Free-form case text.
Description
The case narrative text used as the case/facts content in this dataset.
statutes string
Standard Value
Free-form statutes and legal standards text.
Description
The statutes, legal standards, and supporting rule text paired with the case text.
question string
Standard Value
Should this case be accepted or dismissed?
Description
The target decision question associated with the case record.
label string
Standard Value
Accepted | Dismissed
Description
The decision label for the case.
reference_prolog string
Standard Value
Executable Prolog reference program text.
Description
The reference Prolog program associated with the case record.
case_number string
Standard Value
AAO case number string with embedded year.
Description
The published AAO case number used for case identification and year extraction.

Hard Stats

28 cases 11.57%
Outcome Labels
Accepted 50.00%
14
Dismissed 50.00%
14
Years
2022 39.29%
11
2023 21.43%
6
2024 39.29%
11
Year Total Accepted Dismissed
2022 11
2 18.18%
9 81.82%
2023 6
4 66.67%
2 33.33%
2024 11
8 72.73%
3 27.27%
Metric n Min P25 Median Mean P75 Max
Case Char Count 28 894 1,693 2,050 2,139.25 2,632.75 4,044
Case Paragraph Count 28 1 1 3 2.43 3 4
Case Sentence Count 28 5 8.75 10.50 10.39 12.25 18
Case Word Count 28 137 256.75 320 330.29 391.25 625
Statutes Char Count 28 571 966.25 1,630 1,581.11 1,997 3,228
Statutes Paragraph Count 28 1 1 2 1.71 2 3
Statutes Sentence Count 28 2 9 14 16.07 23.25 31
Statutes Word Count 28 89 167.50 246 250.89 311 544
Case + Statutes Char Count 28 1,708 3,174.25 3,718.50 3,722.36 4,429.50 5,913
Case + Statutes Paragraph Count 28 2 3 4 4.14 5 6
Case + Statutes Sentence Count 28 9 18.75 26 26.39 34 42
Case + Statutes Word Count 28 270 472 554.50 581.18 703.50 926