No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy

Samanta, Amit; Shrinivasan, Suhas; Kaufmann, Antoine; Mace, Jonathan

doi:10.48550/arXiv.1901.06887

Lokale TagsFreigabegeschichteDetailsÜbersicht

No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy

Samanta, A., Shrinivasan, S., Kaufmann, A., & Mace, J. (2019). No DNN Left Behind: Improving Inference in the Cloud with Multi-Tenancy. doi:10.48550/arXiv.1901.06887.

Item is Freigegeben

einblenden: alle ausblenden: alle

Basisdaten

einblenden: ausblenden:

Datensatz-Permalink: https://hdl.handle.net/21.11116/0000-000A-7835-4 Versions-Permalink: https://hdl.handle.net/21.11116/0000-000A-7836-3

Genre: Forschungspapier

Dateien

einblenden: Dateien

ausblenden: Dateien

:

arXiv:1901.06887.pdf (Preprint), 421KB

Öffnen Speichern

Datei-Permalink:
https://hdl.handle.net/21.11116/0000-000A-7837-2

Name:
arXiv:1901.06887.pdf

Beschreibung:
File downloaded from arXiv at 2022-05-17 10:49

OA-Status:

Sichtbarkeit:
Öffentlich

MIME-Typ / Prüfsumme:
application/pdf / [MD5]

Technische Metadaten:

Öffnen

Copyright Datum:
-

Copyright Info:
-

Lizenz:
http://arxiv.org/licenses/nonexclusive-distrib/1.0/

Externe Referenzen

einblenden:

Urheber

einblenden:

ausblenden:

Urheber:
Samanta, Amit¹, Autor
Shrinivasan, Suhas¹, Autor
Kaufmann, Antoine², Autor
Mace, Jonathan¹, Autor

Affiliations:
1Group J. Mace, Max Planck Institute for Software Systems, Max Planck Society, ou_3031907
2Group P. Druschel, Max Planck Institute for Software Systems, Max Planck Society, ou_2105287

Inhalt

einblenden:

ausblenden:

Schlagwörter: Computer Science, Distributed, Parallel, and Cluster Computing, cs.DC

Zusammenfassung: With the rise of machine learning, inference on deep neural networks (DNNs)
has become a core building block on the critical path for many cloud
applications. Applications today rely on isolated ad-hoc deployments that force
users to compromise on consistent latency, elasticity, or cost-efficiency,
depending on workload characteristics. We propose to elevate DNN inference to
be a first class cloud primitive provided by a shared multi-tenant system, akin
to cloud storage, and cloud databases. A shared system enables cost-efficient
operation with consistent performance across the full spectrum of workloads. We
argue that DNN inference is an ideal candidate for a multi-tenant system
because of its narrow and well-defined interface and predictable resource
requirements.

Details

einblenden:

ausblenden:

Sprache(n): eng - English

Datum: Erstellt: 2019-01-21Geändert: 2019-01-23Online veröffentlicht: 2019

Publikationsstatus: Online veröffentlicht

Seiten: 5 p.

Ort, Verlag, Ausgabe: -

Inhaltsverzeichnis: -

Art der Begutachtung: -

Identifikatoren: arXiv: 1901.06887
URI: https://arxiv.org/abs/1901.06887
DOI: 10.48550/arXiv.1901.06887
BibTex Citekey: Samanta_1901.06887

Art des Abschluß: -

Datensatz

Basisdaten

Dateien

Externe Referenzen

Urheber

Inhalt

Details

Veranstaltung

Entscheidung

Projektinformation

Quelle