Engine-Agnostic Model Hot-Swapping for Cost-Effective Llm Inference

AuthID
P-01A-NYX
6
Author(s)
Stoyanov, R
·
Spisaková, V
·
Reber, A
·
Armour, W
·
Copik, M
·
Tipo de Documento
Proceedings Paper
Year published
2025
Publicado
in PROCEEDINGS OF 2025 WORKSHOPS OF THE INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING, NETWORK, STORAGE, AND ANALYSIS, SC25 WORKSHOPS
Páginas: 114-125 (12)
Conference
2025 Workshops of the International Conference for High Performance Computing Networking Storage and Analysis-Sc-W, Date: NOV 16-21, 2025, Location: St. Louis, MO
Indexing
Publication Identifiers
DBLP: conf/sc/StoyanovSRACB25
SCOPUS: 2-s2.0-105023407201
Wos: WOS:001661298800014
Export Publication Metadata
Citações
Oops! It looks like you don't have access to this content.

This section is restricted to uses with b-on access.



CORE Conference
No information about CORE Rank

During the preprocessing phase, only publications of type 'Proceedings Paper' or 'Proceedings' are automatically processed to identify their CORE Rank.

TIP: If your publication's CORE Rank is missing, you can contact with your institutional manager to have the correct ranking manually added to the record.

Journal Factors
Oops! It looks like you don't have access to this content.

This section is restricted to uses with b-on access.

Info
At this moment we don't have any links to full text documens.