...
Attributes | Description | Informational |
---|---|---|
Type | New Blueprint for AI Services at the Edge | |
Blueprint Family - Proposed Name | Network CloudAI Edge Blueprint Family | |
Use Case | Running Real-Time Generative AI Models at the Edge | |
Blueprint proposed Name | Serverless | |
Initial POD Cost (capex) | Leverage Unicycle POD - less than $150k | |
Scale & Type | Up to 7 servers x86/ARM server or deep edge class GPU Accelerators Memory Cache | |
Applications | Edge Servers Hosting Real-Time Generative AI Services, e.g. Speech-to-Speech Translation, LLM-based Virtual Assistant Services, Text-to-Image/Video Content Generation Services, etc. | |
Power Restrictions | Less than 10Kw | |
Infrastructure orchestration | Kubeless Docker 1.13.1 or above and K8s 1.10.2 or above- Container Orchestration OS - Ubuntu 16.x Under Cloud Orchestration - Airship v1.0 | |
SDN | OVS | |
Workload Type | Containers | |
Additional Details | Video Streaming: gRPC, Pytorch, Tensorflow Non-Sql Database: MongodB |
...