Version: 最新版本(unreleased)

AthenaServing Framework (ASF)

Vision

In the whole field, AI capabilities can be rapidly implemented in production applications, and AI models and services can be reached at your fingertips; let ASF become the de facto standard of AI inference service framework.

What is ASF?

It is a service-free, fully managed A.I. engine service framework designed for A.I. algorithm engineers. Algorithm engineers can quickly realize A.I. engine cloud service by integrating the Language Wrapper provided in ASF, without paying attention to the development and operation and maintenance of the underlying infrastructure and service, and can deploy the engine efficiently, safely, autonomously and controllably. Upgrade, scale, monitor and operate.

Relying on iFLYTEK's many years of experience in the production of voice technology as a service, a set of K8S-based service-oriented frameworks focusing on AI engineering and general AI capabilities. It is planned to expand a set of capabilities for managing AI services to k8s based on the service discovery mechanism and CRD mechanism of k8s.

The main features are:

Model post-processing
Model inference service
Model service containerization
Model service governance (service discovery, scaling)
Model service dynamic load balancing
Model service one-click deployment of private cloud
Model service one-click deployment of public cloud ASE
Model service protocol standardization

What is AIGES ?

AIGES is one of the core components of ASF, implemented by golang. It provides a unified standard Wrapper interface for user-mode inference code, currently supports Python/C++, and theoretically supports any language plugin (not yet supported)

Scenario-oriented

The implementation of AI service capabilities by SMEs lacks unified management and implementation plans. Every time a user adds a new AI capability, he needs to go through steps such as encapsulating an engine. Because the encapsulation engine does not have a unified standard and the business logic is complex, it is not easy for users to maintain and refactor.

Solve the problem

1: The landing process of the research side model is too long and it is not easy to iterate 2: There is no unified standard for AI engine side packaging

Overall Architecture (v2)

Workflow

Features

☑ Support model inference into RPC service (Serving framework will be converted into HTTP service)

☑ Support c++/c code infer

☑ Support python code infer

☑ Support configuration center, service discovery

☑ Support three-party API forwarding

Framework code repository

Modules	Repository	Status
☑ loader	loader	Open source
☑ lb_client	Load Balancer Load aggregation component	Open source
☑ WebGate	WebGate Web gateway component	Open source
☑ Atom	Atom Protocol conversion component	Open source
☑ Polaris	Polaris Configuration Center and Service Discovery	Open Source
☑ Helm	[athena_deploy]https://github.com/xfyun/athena_deploy	Open source
☐ Docker Compose	Serving on Docker with docker-compose one-click deployment	To be supported
☐ Documentation	website	In Progress
☑ Protocol	AI Capability Protocol Specification	Open source
☐ AseCTl command line tool	Asectl command line tool	To be open source
☐ Python Debugging Toolkit	AigesKitpython toolkit	In progress

AthenaServing Framework (ASF)

Vision​

What is ASF?​

What is AIGES ?​

Scenario-oriented​

Solve the problem​

Overall Architecture (v2)​

Workflow​

Features​

Framework code repository​