JSC Inference Infrastructure
Alexandre Strube
February 26, 2025
A No-BS Database of How Companies Actually Deploy LLMs in Production: 300+ Technical Case Studies, Including Self-Hosted LLMs in https://www.zenml.io/llmops-database
āI think the complexity of Python package management holds down AI application development more than is widely appreciated. AI faces multiple bottlenecks ā we need more GPUs, better algorithms, cleaner data in large quantities. But when I look at the day-to-day work of application builders, thereās one additional bottleneck that I think is underappreciated: The time spent wrestling with version management is an inefficiency I hope we can reduce.ā
Andrew Ng, 28.02.2024
āBuilding on top of open source can mean hours wrestling with package dependencies, or sometimes even juggling multiple virtual environments or using multiple versions of Python in one application. This is annoying but manageable for experienced developers, but creates a lot of friction for new AI developers entering our field without a background in computer science or software engineering.ā
Andrew Ng, 28.02.2024
Inside config.json, add at the
"models"
section:
Try with the other models you got from the API!
Gitlab link to source code of the slides (needs JUDOOR account)
https://gitlab.jsc.fz-juelich.de/strube1/2024-12-talk-jsc-colloquium