You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
## AWS DL1 EC2 Instance with Intel Gaudi Accelerators
10
+
11
+
This demo will showcase Large Language Model(LLM) inference using Intel Gaudi AI Accelerators. This module will install the base software required to run other examples.
12
+
13
+
## Usage
14
+
15
+
### variables.tf
16
+
17
+
Modify the region to target a specific AWS Region
18
+
19
+
```hcl
20
+
variable "region" {
21
+
description = "Target AWS region to deploy EC2 in."
22
+
type = string
23
+
default = "us-east-1"
24
+
}
25
+
```
26
+
27
+
### main.tf
28
+
29
+
Modify settings in this file to choose your AMI as well as other details around the instance that will be created. This demo was tested on Ubuntu 22.04.
Download and run the [Gen-AI-Gaudi-Demo](https://github.com/intel/terraform-intel-aws-vm/tree/main/examples/gen-ai-gaudi-base) Terraform Module by typing this command
Change into the `examples/gen-ai-gaudi-base` example folder
94
+
95
+
```Shell
96
+
cd terraform-intel-aws-vm/examples/gen-ai-gaudi-demo
97
+
```
98
+
99
+
Run the Terraform Commands below to deploy the demos.
100
+
101
+
```Shell
102
+
terraform init
103
+
terraform plan
104
+
terraform apply
105
+
```
106
+
107
+
After the Terraform module successfully creates the EC2 instance, **wait ~15 minutes** for the recipe to download/install the Intel Gaudi driver and software. After the deployment is done, you can launch the Habana Gaudi PyTorch container using the following:
description="The state of the instance. One of: `pending`, `running`, `shutting-down`, `terminated`, `stopping`, `stopped`"
18
+
value=try(module.ec2-vm.*.instance_state, "")
19
+
}
20
+
21
+
output"outpost_arn" {
22
+
description="The ARN of the Outpost the instance is assigned to"
23
+
value=try(module.ec2-vm.*.outpost_arn, "")
24
+
}
25
+
26
+
output"password_data" {
27
+
description="Base-64 encoded encrypted password data for the instance. Useful for getting the administrator password for instances running Microsoft Windows. This attribute is only exported if `get_password_data` is true"
28
+
value=try(module.ec2-vm.*.password_data, "")
29
+
}
30
+
31
+
output"primary_network_interface_id" {
32
+
description="The ID of the instance's primary network interface"
description="The private DNS name assigned to the instance. Can only be used inside the Amazon EC2, and only available if you've enabled DNS hostnames for your VPC"
38
+
value=try(module.ec2-vm.*.private_dns, "")
39
+
}
40
+
41
+
output"public_dns" {
42
+
description="The public DNS name assigned to the instance. For EC2-VPC, this is only available if you've enabled DNS hostnames for your VPC"
43
+
value=try(module.ec2-vm.*.public_dns, "")
44
+
}
45
+
46
+
output"public_ip" {
47
+
description="The public IP address assigned to the instance, if applicable. NOTE: If you are using an aws_eip with your instance, you should refer to the EIP's address directly and not use `public_ip` as this field will change after the EIP is attached"
48
+
value=try(module.ec2-vm.*.public_ip, "")
49
+
}
50
+
51
+
output"private_ip" {
52
+
description="The private IP address assigned to the instance."
53
+
value=try(module.ec2-vm.*.private_ip, "")
54
+
}
55
+
56
+
output"ipv6_addresses" {
57
+
description="The IPv6 address assigned to the instance, if applicable."
58
+
value=try(module.ec2-vm.*.ipv6_addresses, [])
59
+
}
60
+
61
+
output"tags_all" {
62
+
description="A map of tags assigned to the resource, including those inherited from the provider default_tags configuration block"
63
+
value=try(module.ec2-vm.*.tags_all, {})
64
+
}
65
+
66
+
output"spot_bid_status" {
67
+
description="The current bid status of the Spot Instance Request"
68
+
value=try(module.ec2-vm.*.spot_bid_status, "")
69
+
}
70
+
71
+
output"spot_request_state" {
72
+
description="The current request state of the Spot Instance Request"
73
+
value=try(module.ec2-vm.*.spot_request_state, "")
74
+
}
75
+
76
+
output"spot_instance_id" {
77
+
description="The Instance ID (if any) that is currently fulfilling the Spot Instance request"
0 commit comments