BharatCast logo

Collaborative Forecasting Platform

Built In India AI Weather Intelligence

BharatCast

BharatCast is being developed as a foundational AI model for weather forecasting and extreme-event prediction built in India, designed to serve the operational and scientific needs of the Indian forecasting ecosystem.

The project addresses a critical gap in weather AI: most state-of-the-art systems are trained and validated primarily for global benchmarks, but India needs models that are explicitly shaped around monsoon dynamics, heavy rainfall, cyclones, regional terrain, nowcasting-to-medium-range forecasting, and operational decision support. BharatCast aims to deliver an indigenous forecasting stack that can learn from Indian and global data, model atmospheric evolution across scales, support extreme-weather diagnostics, and eventually provide a complete AI-driven platform for forecast generation, evaluation, downscaling, and public-facing delivery.

Methodology

Incremental conditional diffusion for weather forecasting.

BharatCast predicts the next atmospheric update, not the full future state in one step. The current weather field is used as the conditioning signal, so the forecast remains anchored to the latest analyzed atmosphere.

During training, the target increment is perturbed with controlled stochastic noise. A neural network is then trained to infer how that noisy increment should be corrected so that it moves back toward a physically plausible weather evolution.

In this formulation, the score function is approximated by the network itself. It learns the local direction in which a candidate update should be adjusted across temperature, pressure, humidity, and wind so that the forecast increment becomes sharper, smoother, stronger, or weaker where needed.

At inference time, these learned denoising directions are followed step by step to refine the future increment, which is then added back to the latest state to produce the next forecast. This gives BharatCast a technically grounded way to model evolving weather while preserving continuity with the current atmosphere.

Platform Status

India-focused AI forecasting platform integrating benchmarking, modeling, downscaling, and operational delivery.

BharatCast brings together curated datasets, regional forecast modeling, downscaling pipelines, and deployment-oriented interfaces to support weather and monsoon intelligence for India.

Forecasting Scope India-focused AI Weather

Collaborative development across forecasting, diagnostics, and operational interface design.

Forecast Scope 6h to 7-day

Autoregressive nowcasting and medium-range rollouts over Indian and regional domains.

Scale Path 0.25° to 0.0625°

Multi-scale refinement from regional grids toward 12-6 km and localized extreme-event products.

Deployment Path MeitY GPU + Web APIs

Open-access visualization dashboards and APIs supporting the operational deployment core.

Adaptive Multi-Scale AI for India

Weather and monsoon extremes forecasting in India with an indigenous multi-institutional AI framework.

BharatCast benchmarks global AI weather systems, trains indigenous spatio-temporal models on Indian datasets, refines forecasts across scales, and supports early warning for heavy rainfall, cyclones, and hydro-meteorological extremes.

Program BharatCast Collaborative Initiative Current Focus Benchmarking, Modeling, Deployment
Core Benchmarking GraphCast, Pangu, GenCast measure bias and skill drop over Indian datasets
Primary Forecast Grid 0.25° India-focused AI regional and local-scale weather dynamics
Downscaling Goal 0.125° / 0.0625° diffusion and flow-based high-resolution refinement
Decision Support Heavy rainfall and cyclones improved lead time and regional risk awareness

How BharatCast Works

Compact overview of the current WeatherBench2 diffusion prototype

BharatCast currently uses a curated WeatherBench2 ERA5 subset, a patch-based conditional diffusion model, and global overlap-add stitching to generate the GT-vs-Pred videos shown below.

01

Data

WeatherBench2 subset

We start from a carefully selected WeatherBench2 ERA5 archive covering many years of global weather at regular 6-hour intervals.

  • It includes the main weather signals needed for forecasting, such as temperature, wind, pressure, humidity, and rainfall.
  • Upper-air information is retained across several standard pressure levels so the model can see how the atmosphere evolves vertically.
  • The result is a compact but meteorologically meaningful training set designed for efficient experimentation.
02

Formulation

Increment-based conditional diffusion

Instead of redrawing the entire future atmosphere from scratch, the model learns the next change that should happen from the current weather state.

  • Each training example asks the model to predict how the atmosphere should change over the next 24 hours.
  • The latest observed field acts as the anchor, so forecasts stay tied to the real atmospheric state.
  • This makes the prediction task more stable because the model focuses on what is changing, not on relearning what is already present.
03

Training

Patch cache and UNet setup

The global fields are broken into smaller overlapping regions so the model can learn detailed local structure without losing atmospheric context.

  • We train on a large collection of overlapping weather patches drawn from the global archive.
  • A multi-scale UNet learns how a noisy candidate update should be cleaned into a physically consistent weather change.
  • The current prototype is a large-capacity model, with roughly 99 million parameters, built to capture both broad circulation and local detail.
04

Inference + References

Global stitching and related systems

At forecast time, the model produces many local updates, and those updates are blended back together to recover a smooth full-globe prediction.

Primary Forecast View

2m Temperature Evolution

WeatherBench2 Global forecast media

Current Model Note

Current forecast media is produced from a patch-based diffusion model trained on a WeatherBench2 subset stored at data/interim/subsets/era5_1p5deg_6h_2000-2018_minvars.zarr. The training cache uses 64×64 patches with 32-cell stride, pred_len=4 corresponding to a +24h target at 6-hour resolution, and 30 flattened channels derived from 10 core variables / variable groups.

The saved BharatCast checkpoint contains approximately 99.1 million parameters with an EMA weight footprint of about 378 MB, while the full best.pt training checkpoint on disk is approximately 1.54 GB. The current training run configuration uses batch_size=8, lr=2e-4, and diffusion noise settings sigma_min=0.01, sigma_max=1.0.

This model run was trained on an NVIDIA RTX 6000 Ada Generation 48 GB class GPU setup for local experimentation and qualitative validation before larger-scale deployment on shared infrastructure.

Research Dashboard

Work packages

O1 Data Curation + Benchmarking Benchmark global AI weather models on Indian data NCMRWF, satellite, ground, and reanalysis harmonization
O2-O4 BharatCast Modeling Indigenous multi-scale AI with extremes sensitivity GNNs, FNOs, diffusion, ensembles, and downscaling modules
O5 Operational Deployment Open-access web portal with APIs and dashboards MeitY GPU integration for research, operations, and decision support

Key Diagnostics

Evidence package

RMSE ACC CRPS comparison
Prototype forecast-skill benchmarking
Spread skill coverage
Uncertainty and calibration diagnostics
Rank and PIT histograms
Reliability evidence for probabilistic outputs
Power spectrum and SSIM
Spatial fidelity and structure preservation

Media Lab

Prototype validation media

Operational Deployment Core

Current gaps before operationalization

  • Benchmark GraphCast, Pangu-Weather, and GenCast on Indian datasets with regional bias reporting
  • Validate BharatCast against NCMRWF data across monsoon and non-monsoon regimes
  • Integrate MeitY GPU-backed inference, data pipelines, and API delivery for open-access forecasts
  • Extend 0.25° forecasts toward 0.125° and 0.0625° downscaled products
  • Add extreme-event modules for heavy rainfall, cyclones, and flood-oriented risk indicators

Development Roadmap

Year-wise platform milestones

1

Year 1: curate harmonized NWP, reanalysis, satellite, and observational datasets and deliver validated baseline benchmarking reports.

2

Year 2: develop BharatCast forecasting, downscaling, and extremes modules into a functional AI system validated on NCMRWF data.

3

Year 3: deploy the end-to-end open web platform with APIs, dashboards, and operational forecast delivery on MeitY infrastructure.

Reference Resources

Datasets, and Operational Links

Curated benchmark datasets, India-focused data portals, and operational weather links relevant for training, validating, and deploying climate and weather AI systems.

Benchmark Dataset

WeatherBench 2

Open benchmark suite for data-driven global weather forecasting, widely used to compare deterministic and probabilistic forecast skill.

Storm Dataset

SEVIR

Large-scale severe-weather event dataset combining radar, satellite, lightning, and storm-event information for convective nowcasting.

Radar Benchmark

HKO-7 Radar Echo Dataset

Canonical precipitation nowcasting benchmark built from Hong Kong Observatory radar echo sequences for short-range spatio-temporal forecasting.

Global Reanalysis

Copernicus Climate Data Store

Official portal for ERA5 and related reanalysis, climate, and atmospheric datasets used for training, benchmarking, and verification workflows.

India Operational

IMD Open Data and Products

India Meteorological Department operational portal for forecasts, warnings, rainfall products, observations, and public weather bulletins.

India Data Portal

AIKosh

IndiaAI data repository for discoverable AI-ready datasets and future India-specific foundation-model training sources.

India Satellite Data

MOSDAC

Meteorological and Oceanographic Satellite Data Archival Centre for INSAT and related satellite products relevant for convection and monsoon analysis.

Operational Partner

NCMRWF

Operational forecasting partner for numerical weather prediction, diagnostics, forecast products, and validation workflows.

Research Papers

Foundation Models, Forecasting Papers & Platforms

Key papers and platform references spanning graph models, neural operators, diffusion models, and large-scale weather AI systems that inform the BharatCast prototype.

arXiv / NVIDIA

FourCastNet

Authors: Jaideep Pathak, Shashank Subramanian, Ashesh Chattopadhyay, et al.

High-resolution global weather model using adaptive Fourier neural operators.

Authors: Zongyi Li, Nikola Kovachki, Kamyar Azizzadenesheli, et al.

One of the core neural-operator papers underlying much of modern climate and weather surrogate modeling.

NVIDIA Platform

Earth-2

Authors: NVIDIA

Climate and weather digital-twin platform relevant as a systems-level reference for scalable AI weather infrastructure.

NVIDIA Tooling

Earth2Studio

Authors: NVIDIA

Tooling and workflows for Earth-2 and AI weather/climate experimentation.

Collaborative Project

Developed as a joint effort across academic institutes and operational forecasting partners.

IIT BHU logo
IIT BHU
NCMRWF logo
NCMRWF
IIT Roorkee logo
IIT Roorkee
IIT Ropar logo
IIT Ropar