Articles producció científicaEnginyeria Informàtica i Matemàtiques

GeoServe: Leveraging Disaggregated Data Processing for Scalable Geospatial Model Serving

  • Identification data

    Identifier:  imarina:9586094
    Authors:  Gerard Finol; Christian Pinto
    Abstract:
    Geospatial foundation models (GFMs) operate on large, multi-band raster products (e.g., GeoTIFF) that require expensive data access and preprocessing - reprojection, decoding, normalization, and tiling - before GPU inference. In our measurements, reading and preprocessing geospatial inputs can be orders of magnitude slower than tokenization or standard image preprocessing, and constitute 31 - 43% of end-to-end request time for a representative GFM. Existing inference frameworks such as vLLM execute this preprocessing inline with request handling, which under load serializes CPU and I/O work, increasing queueing delay, and leaving GPUs underutilized. We present GeoServe, a Ray-based serving system that decouples the geospatial data pipeline from GPU inference by disaggregating I/O- and CPU-heavy preprocessing to a scalable pool of CPU workers, while keeping GPU nodes dedicated to model forward passes. We show experimentally that GeoServe reduces the p90 request latency by up to 414.9× at high load and improves throughput by up to 4.74× compared to vanilla vLLM, while increasing the achieved model forward-pass rate from ~ 16 inf./sec to ~ 72 inf./sec via better batching opportunities.
  • Others:

    Link to the original source: https://dl.acm.org/doi/10.1145/3805621.3807611
    Funding program action: Optimización inteligente del Análisis de datos extremos (X-AI)
    APA: Gerard Finol; Christian Pinto (2026). GeoServe: Leveraging Disaggregated Data Processing for Scalable Geospatial Model Serving.
    Paper original source: Proceedings of the Sixth European Workshop on Machine Learning and Systems. 246-253
    Program founding action 2: Cloud open source research mobility network
    Article's DOI: 10.1145/3805621.3807611
    Funding program: Pla Nacional, Projectes RDI del Ministerio de Ciencia, Innovación y Universidades
    Journal publication year: 2026-04-28
    Entity: Universitat Rovira i Virgili
    Paper version: info:eu-repo/semantics/publishedVersion
    Record's date: 2026-06-13
    URV's Author/s: Finol Peñalver, Gerard
    Project code 2: 101086248
    Department: Enginyeria Informàtica i Matemàtiques
    Licence document URL: https://repositori.urv.cat/ca/proteccio-de-dades/
    Publication Type: Proceedings Paper
    Founding program 2: Horizon Europe - MSCA Staff Exchanges 2021
    Author, as appears in the article.: Gerard Finol; Christian Pinto
    Project code: PID2023-148202OB-C21
    licence for use: https://creativecommons.org/licenses/by/3.0/es/
    Author's mail: gerard.finol@urv.cat
    Acronym 2: CLOUDSTARS