[CVPR 2026] FAKER-Air: Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization

FAKER-Air (GRPO)

Model Description

This repository provides the GRPO-trained checkpoint of FAKER-Air, a regional air-quality forecasting model for East Asia.

FAKER-Air (Forecast Alignment via Knowledge-guided Expected-Reward) is built on an Aurora-based 3D encoder-decoder and trained in two stages:

Supervised Fine-Tuning (SFT) with Temporal Accumulation Loss to reduce exposure bias during autoregressive rollout.
Group-Relative Policy Optimization (GRPO) with AQI-aware rewards and curriculum rollout to improve operational reliability, especially by reducing false alarms.

The model is designed for long-horizon particulate matter forecasting up to 120 hours (5 days) ahead using real-time observations and regional CMAQ reanalysis for East Asia.

Intended Use

This model is intended for:

Research on long-horizon air-quality forecasting
Spatiotemporal forecasting over East Asia
Reliability-aware forecasting and AQI-oriented evaluation
Offline experiments and prototype operational systems

Primary Outputs

The model is mainly used to forecast:

PM2.5
PM10

These continuous predictions can also be mapped to AQI-style discrete categories such as Good, Moderate, Bad, and VeryBad.

Out-of-Scope Use

This model is not intended for:

Direct public warning issuance without local validation and expert review
Deployment outside the East Asia domain used in the paper
Use with input data that does not match the official preprocessing and directory structure
Clinical, legal, or regulatory decision-making without independent verification

Training Data

FAKER-Air is trained on the CMAQ-OBS Regional Air Quality Dataset for East Asia.

The released dataset combines:

Observation fields under data/obs
CMAQ reanalysis fields under data/cmaq

The dataset is built for East Asia at 27 km spatial resolution and 6-hour temporal intervals, combining grid-aligned observation fields with region-specific CMAQ reanalysis.

Model Inputs and Outputs

Inputs

The official pipeline expects preprocessed inputs matching the repository format.

In practice, inference uses:

Gridded OBS inputs (.npz)
CMAQ concentration fields (*_x_conc.npy)
CMAQ 2D meteorological fields (*_x_metcro2d.npy)
CMAQ 3D meteorological fields (*_x_metcro3d.npy)

Please follow the preprocessing and directory structure in the official repository before running evaluation or inference.

Outputs

The model predicts long-horizon particulate matter fields over the East Asia forecasting domain, especially:

PM2.5 concentration fields
PM10 concentration fields

These outputs can be further converted into AQI-style classes for operational analysis.

Training Procedure

The official training recipe uses a two-stage setup:

Stage 1: SFT with Temporal Accumulation Loss
Stage 2: GRPO with class-wise AQI rewards and curriculum rollout

This design is intended to combine strong spatiotemporal forecasting performance with improved operational reliability.

Evaluation

The paper evaluates long-horizon forecasting up to 120 hours ahead.

Selected Reported Results for PM2.5 at 120h

Binary F1: 56.72
False Alarm Rate (FAR): 17.32
4-class Accuracy: 45.16
4-class F1-macro: 41.90

Compared with the SFT baseline, the GRPO model is reported to reduce PM2.5 FAR from 32.86 to 17.32, while maintaining competitive F1.
For PM10, the paper reports a reduction in FAR from 18.44 to 10.81 in the 120h setting.

For full results, please refer to the paper and supplementary material.

Limitations

This checkpoint is region-specific and is designed for East Asia, not for global forecasting.
Performance depends on using the same preprocessing, variables, grid definition, and rollout setup as the official codebase.
The model is optimized for reliability-aware forecasting, so some operating points may trade raw point-wise accuracy for lower false alarm rates.
Performance may degrade under severe domain shift, missing local inputs, or deployment in regions outside the training domain.

How to Use

1) Download the checkpoint

from huggingface_hub import hf_hub_download

ckpt_path = hf_hub_download(
    repo_id="2na-97/FAKER-Air",
    filename="FAKER-Air_GRPO",  # change this if you rename the checkpoint file
)

print(ckpt_path)

2) Prepare the dataset

Dataset:

2na-97/FAKER-Air

Code:

https://github.com/kaist-cvml/FAKER-Air/tree/main

Follow the repository instructions to place OBS and CMAQ files in the expected directories.

3) Run rollout evaluation / inference

CUDA_VISIBLE_DEVICES=0 \
torchrun \
  --nproc_per_node=1 \
  --master_addr="127.0.0.1" \
  --master_port=29502 \
  test.py --batch 1 \
  --model aurora \
  --test-start-date 2023-01-01 \
  --test-end-date 2023-12-31 \
  --data-sources obs,cmaq \
  --checkpoint-path /path/to/FAKER-Air_GRPO \
  --npz-path ./data/obs_npz_27km \
  --cmaq-root ./data/cmaq_only_npy \
  --mode rollout \
  --rollout-hours 120 \
  --use_cmaq_pm_only

Repository Links

Code: https://github.com/kaist-cvml/FAKER-Air/tree/main
Dataset: https://huggingface.co/datasets/2na-97/FAKER-Air
Model: https://huggingface.co/2na-97/FAKER-Air

Citation

@article{kang2026fakerair,
  title={Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization},
  author={Kang, Inha and Kim, Eunki and Ryu, Wonjeong and Shin, Jaeyo and Yu, Seungjun and Kang, Yoon-Hee and Jeong, Seongeun and Kim, Eunhye and Kim, Soontae and Shim, Hyunjung},
  journal={arXiv preprint arXiv:2511.22169},
  year={2026}
}
``

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

Time Series Forecasting

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Dataset used to train 2na-97/FAKER-Air

Paper for 2na-97/FAKER-Air

Real-Time Long Horizon Air Quality Forecasting via Group-Relative Policy Optimization

Paper • 2511.22169 • Published Nov 27, 2025 • 1