DeepLearn Batching Engine

Dynamic Batching Engine for Deep Learning Serving. A tool that implements dynamic batching with batch size and latency factors.

Warning

This tool is currently a proof of concept. Refer to MOSEC for production usage.

Main Features

Dynamic batching with control over batch size and latency
Prevents invalid requests from affecting others in the same batch
Communicates with workers through Unix domain socket or TCP
Supports load balancing

Click here to read more about the design concept.

Configuration Principles

go run service/app.go --help

Usage app:
  -address string
        socket file or host:port (default "batch.socket")
  -batch int
        max batch size (default 32)
  -capacity int
        max jobs in the queue (default 1024)
  -host string
        host address (default "0.0.0.0")
  -latency int
        max latency (millisecond) (default 10)
  -port int
        service port (default 8080)
  -protocol string
        unix or tcp (default "unix")
  -timeout int
        timeout for a job (millisecond) (default 5000)

Demonstration

go run service/app.go
python examples/app.py
python examples/client.py

Name		Name	Last commit message	Last commit date
Latest commit History 365 Commits
.github/workflows		.github/workflows
examples		examples
service		service
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
batch.go		batch.go
go.mod		go.mod
go.sum		go.sum
revive.toml		revive.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DeepLearn Batching Engine

Warning

Main Features

Configuration Principles

Demonstration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

DeepLearn Batching Engine

Warning

Main Features

Configuration Principles

Demonstration

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages