pulumi-gcp-vertex-model-deployment

Pulumi custom provider to upload and deploy models to the Vertex platform.

Deploy model for batched prediction jobs

Upload model with a docker image to the model registry:

modelArtifactsURI := pulumi.Sprintf("gs://%s/models/my-model", myArtifactsBucket.Name)
modelImageURL := "us-central1-docker.pkg.dev/my-project/my-docker-registry/pytorch-cpu.1-12-bert-with-cpr@latest"

modelDeployment, err := vertexmodeldeployment.NewVertexModelDeployment(ctx, "model-for-batch-prediction",
  &vertexmodeldeployment.VertexModelDeploymentArgs{
    ProjectId:                      pulumi.String("my-project"),
    Region:                         pulumi.String("us-central1"),
    ModelArtifactsBucketUri:        modelArtifactsURI,
    ModelImageUrl:                  modelImageURL,
    ModelPredictionInputSchemaUri:  pulumi.Sprintf("%s/%s", modelArtifactsURI, v.ModelPredictionInputSchemaPath),
    ModelPredictionOutputSchemaUri: pulumi.Sprintf("%s/%s", modelArtifactsURI, v.ModelPredictionOutputSchemaPath),
    ServiceAccount:                 modelServiceAccount.Email,
    PredictRoute:                   pulumi.String("/predict"),
    HealthRoute:                    pulumi.String("/health"),
  },
  pulumi.Parent(v), pulumi.DependsOn(dependencies),
)
if err != nil {
  return fmt.Errorf("failed to deploy model /o\\: %w", err)
}

Returned model name should be set on the batch prediction job.

Deploy model to an endpoint

Upload and deploy model with a docker image to a given Vertex endpoint:

modelArtifactsURI := pulumi.Sprintf("gs://%s/models/my-model", myArtifactsBucket.Name)
modelImageURL := "us-docker.pkg.dev/vertex-ai/prediction/tf2-cpu.2-15:latest"

inputSchemaURI := pulumi.Sprintf("gs://%s/schemas/input_schema.yaml", myArtifactsBucket.Name)
outputSchemaURI := pulumi.Sprintf("gs://%s/schemas/output_schema.yaml", myArtifactsBucket.Name)

modelDeployment, err := vertexmodeldeployment.NewVertexModelDeployment(ctx, "model-for-endpoint", &vertexmodeldeployment.VertexModelDeploymentArgs{
  ProjectId:                       pulumi.String("my-project"),
  Region:                          pulumi.String("us-central1"),
  ModelArtifactsBucketUri:         modelArtifactsURI,
  ModelImageUrl:                   pulumi.String(modelImageURL),
  ModelPredictionInputSchemaUri:   inputSchemaURI,
  ModelPredictionOutputSchemaUri:  outputSchemaURI,
  ServiceAccount:                  modelServiceAccount.Email,

  // Optional deployment to an Endpoint. Not required for batched prediction jobs
  EndpointModelDeployment: &vertexmodeldeployment.EndpointModelDeploymentArgs{
    EndpointId:     myendpoint.Name,
    MachineType:    pulumi.String("g2-standard-8"),
    MinReplicas:    pulumi.Int(1),
    MaxReplicas:    pulumi.Int(3),
    TrafficPercent: pulumi.Int(100),
  },
}, pulumi.Parent(v))
if err != nil {
  return fmt.Errorf("failed to deploy model /o\\: %w", err)
}

See:

Getting Started

Install for local dev:

PROVIDER_VERSION=0.0.0 GOOS=darwin GOARCH=arm64 make plugin-local

Get the SDK:

go get github.com/davidmontoyago/pulumi-gcp-vertex-model-deployment/sdk/go

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
.github		.github
cmd		cmd
pkg		pkg
sdk		sdk
.gitignore		.gitignore
.golangci.yml		.golangci.yml
LICENSE		LICENSE
Makefile		Makefile
PulumiPlugin.yaml		PulumiPlugin.yaml
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

pulumi-gcp-vertex-model-deployment

Deploy model for batched prediction jobs

Deploy model to an endpoint

Getting Started

About

Uh oh!

Releases 6

Packages

Contributors 2

Languages

License

davidmontoyago/pulumi-gcp-vertex-model-deployment

Folders and files

Latest commit

History

Repository files navigation

pulumi-gcp-vertex-model-deployment

Deploy model for batched prediction jobs

Deploy model to an endpoint

Getting Started

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 6

Packages 0

Contributors 2

Languages

Packages