Improving Carbon Emissions of Federated Large Language Model Inference through Classification of Task-Specificity

Conference publication at HotCarbon 2024 - We present a paper to reduce the energy consumption of LLM inference by using specialized open source models selected by a classifier beforehand.