Skip to content
Docs

SambaNova LLM Provider

The SambaNova provider connects Beluga AI to SambaNova’s inference platform, which provides high-throughput inference on custom RDU (Reconfigurable Dataflow Unit) hardware. SambaNova exposes an OpenAI-compatible API, so this provider supports all standard features including streaming, tool calling, and structured output.

Choose SambaNova when you need high-throughput batch inference or consistent low-latency serving of large open-source models. SambaNova’s custom RDU hardware is optimized for sustained throughput, making it well-suited for production workloads with predictable high-volume traffic.

Terminal window
go get github.com/lookatitude/beluga-ai/llm/providers/sambanova
FieldRequiredDefaultDescription
ModelYesModel ID (e.g. "Meta-Llama-3.3-70B-Instruct")
APIKeyYesSambaNova API key (sn-...)
BaseURLNohttps://api.sambanova.ai/v1Override API endpoint
TimeoutNo30sRequest timeout

Environment variables:

VariableMaps to
SAMBANOVA_API_KEYAPIKey
package main
import (
"context"
"fmt"
"log"
"os"
"github.com/lookatitude/beluga-ai/config"
"github.com/lookatitude/beluga-ai/llm"
"github.com/lookatitude/beluga-ai/schema"
_ "github.com/lookatitude/beluga-ai/llm/providers/sambanova"
)
func main() {
model, err := llm.New("sambanova", config.ProviderConfig{
Model: "Meta-Llama-3.3-70B-Instruct",
APIKey: os.Getenv("SAMBANOVA_API_KEY"),
})
if err != nil {
log.Fatal(err)
}
msgs := []schema.Message{
schema.NewSystemMessage("You are a helpful assistant."),
schema.NewHumanMessage("What is the capital of France?"),
}
resp, err := model.Generate(context.Background(), msgs)
if err != nil {
log.Fatal(err)
}
fmt.Println(resp.Text())
}
for chunk, err := range model.Stream(context.Background(), msgs) {
if err != nil {
log.Fatal(err)
}
fmt.Print(chunk.Delta)
}
fmt.Println()
modelWithTools := model.BindTools(tools)
resp, err := modelWithTools.Generate(ctx, msgs, llm.WithToolChoice(llm.ToolChoiceAuto))
resp, err := model.Generate(ctx, msgs,
llm.WithResponseFormat(llm.ResponseFormat{Type: "json_object"}),
)
resp, err := model.Generate(ctx, msgs,
llm.WithTemperature(0.7),
llm.WithMaxTokens(2048),
llm.WithTopP(0.9),
llm.WithStopSequences("END"),
)
resp, err := model.Generate(ctx, msgs)
if err != nil {
log.Fatal(err)
}
import "github.com/lookatitude/beluga-ai/llm/providers/sambanova"
model, err := sambanova.New(config.ProviderConfig{
Model: "Meta-Llama-3.3-70B-Instruct",
APIKey: os.Getenv("SAMBANOVA_API_KEY"),
})
Model IDDescription
Meta-Llama-3.3-70B-InstructLlama 3.3 70B — best quality/speed tradeoff
Meta-Llama-3.1-8B-InstructLlama 3.1 8B — fastest, lowest cost
Meta-Llama-3.1-405B-InstructLlama 3.1 405B — most capable, highest throughput

Refer to SambaNova’s documentation for the latest model list.