USCIS AAO Dataset

Dataset explorer for schema metadata, statistics, and downloads.
Whole Set Cases
242
The full USCIS AAO set
Selected Set
Not-Hard
214 cases ยท 88.43%
Content Scope
Schema + Stats
Raw cases are available through zip downloads

Choose Set

Switch between the generated stats views for whole, not-hard, and hard.
Whole
242 cases
100.00% of the whole set
Not-Hard
214 cases
Selected
88.43% of the whole set
Hard
28 cases
11.57% of the whole set

JSON Structure

7 keys
{
  "id": "uscis_xxxxxxxx",
  "text": "Case text",
  "statutes": "Statutes and legal standards text",
  "question": "Should this case be accepted or dismissed?",
  "label": "Accepted | Dismissed",
  "reference_prolog": "Reference Prolog program",
  "case_number": "APR092024_01E2309"
}

Field Metadata

Current Dataset
id string
Standard Value
Stable internal case identifier.
Description
Unique identifier for the case record in this dataset.
text string
Standard Value
Free-form case text.
Description
The case narrative text used as the case/facts content in this dataset.
statutes string
Standard Value
Free-form statutes and legal standards text.
Description
The statutes, legal standards, and supporting rule text paired with the case text.
question string
Standard Value
Should this case be accepted or dismissed?
Description
The target decision question associated with the case record.
label string
Standard Value
Accepted | Dismissed
Description
The decision label for the case.
reference_prolog string
Standard Value
Executable Prolog reference program text.
Description
The reference Prolog program associated with the case record.
case_number string
Standard Value
AAO case number string with embedded year.
Description
The published AAO case number used for case identification and year extraction.

Not-Hard Stats

214 cases 88.43%
Outcome Labels
Accepted 50.00%
107
Dismissed 50.00%
107
Years
2023 6.07%
13
2024 82.71%
177
2025 11.21%
24
Year Total Accepted Dismissed
2023 13
13 100.00%
0 0.00%
2024 177
70 39.55%
107 60.45%
2025 24
24 100.00%
0 0.00%
Metric n Min P25 Median Mean P75 Max
Case Char Count 214 485 1,554 1,970.50 2,090.92 2,578.25 4,363
Case Paragraph Count 214 1 1 3 2.61 3 12
Case Sentence Count 214 3 8 11 10.83 13 26
Case Word Count 214 75 235 308 318.96 391 664
Statutes Char Count 214 402 1,323 1,827 1,961.51 2,331 6,574
Statutes Paragraph Count 214 1 2 2 1.87 2 4
Statutes Sentence Count 214 3 11.25 17 19.24 23 75
Statutes Word Count 214 61 205.25 275 306.52 357.25 1,060
Case + Statutes Char Count 214 1,333 3,051.75 3,943 4,054.43 4,784.75 10,185
Case + Statutes Paragraph Count 214 2 3 4 4.48 5 14
Case + Statutes Sentence Count 214 6 22 28.50 30.01 35 85
Case + Statutes Word Count 214 202 467.50 599 625.49 729.25 1,498