Explanation¶
Background material that helps you understand why Rasteret works the way it does.
Practical workflow lens:
build/load/as_collection -> subset/where -> get_xarray/get_numpy/get_gdf/sample_points.
- Architecture: component overview and data flow
- Design Decisions: why Parquet, why a custom reader, why COGs stay COGs, and why these choices enable an open community
- Correctness Contract: the user-visible guarantees Rasteret makes (and the contract contributors must preserve)
- Schema Contract: column guarantees for contributors building ingest drivers
- Ecosystem Comparison: how Rasteret relates to TorchGeo, xarray, rasterio, and similar tools
- Benchmarks: apples-to-apples performance comparison
- TorchGeo Benchmark: Rasteret vs Native Rasterio: interactive notebook - Rasteret vs TorchGeo native, side-by-side
Every design choice here (Parquet indexes, open catalog descriptors, decoupled storage backends) is made to keep Rasteret interoperable and community-driven. The catalog format is intentionally spec-aligned and evolving toward a portable standard shareable across tools, not just Rasteret.
These are deliberate trade-offs, not accidental ones. If you have ideas, see something that could work differently, or want to discuss the reasoning, open a Discussion. The design benefits from more perspectives.