Amazon EKS Cluster w/ Istio¶

This example shows how to provision an EKS cluster with Istio.

Deploy EKS Cluster with one managed node group in an VPC
Add node_security_group rules for port access required for Istio communication
Install Istio using Helm resources in Terraform
Install Istio Ingress Gateway using Helm resources in Terraform
This step deploys a Service of type LoadBalancer that creates an AWS Network Load Balancer.
Deploy/Validate Istio communication using sample application

Refer to the documentation on Istio concepts.

Deploy¶

See here for the prerequisites and run the following command to deploy this pattern.

terraform init
terraform apply --auto-approve

Once the resources have been provisioned, you will need to replace the istio-ingress pods due to a istiod dependency issue. Use the following command to perform a rolling restart of the istio-ingress pods:

kubectl rollout restart deployment istio-ingress -n istio-ingress

Observability Add-ons¶

Use the following code snippet to add the Istio Observability Add-ons on the EKS cluster with deployed Istio.

for ADDON in kiali jaeger prometheus grafana
do
    ADDON_URL="https://raw.githubusercontent.com/istio/istio/release-1.20/samples/addons/$ADDON.yaml"
    kubectl apply --server-side -f $ADDON_URL
done

Validate¶

List out all pods and services in the istio-system namespace:

kubectl get pods,svc -n istio-system
kubectl get pods,svc -n istio-ingress

NAME                             READY   STATUS    RESTARTS   AGE
pod/grafana-7d4f5589fb-4xj9m     1/1     Running   0          4m14s
pod/istiod-ff577f8b8-c8ssk       1/1     Running   0          4m40s
pod/jaeger-58c79c85cd-n7bkx      1/1     Running   0          4m14s
pod/kiali-749d76d7bb-8kjg7       1/1     Running   0          4m14s
pod/prometheus-5d5d6d6fc-s1txl   2/2     Running   0          4m15s

NAME                       TYPE        CLUSTER-IP       EXTERNAL-IP   PORT(S)                                 AGE
service/grafana            ClusterIP   172.20.141.12    <none>        3000/TCP                                4m14s
service/istiod             ClusterIP   172.20.172.70    <none>        15010/TCP,15012/TCP,443/TCP,15014/TCP   4m40s
service/jaeger-collector   ClusterIP   172.20.223.28    <none>        14268/TCP,14250/TCP,9411/TCP            4m15s
service/kiali              ClusterIP   172.20.182.231   <none>        20001/TCP,9090/TCP                      4m15s
service/prometheus         ClusterIP   172.20.89.64     <none>        9090/TCP                                4m14s
service/tracing            ClusterIP   172.20.253.201   <none>        80/TCP,16685/TCP                        4m14s
service/zipkin             ClusterIP   172.20.221.157   <none>        9411/TCP                                4m15s

NAME                                 READY   STATUS    RESTARTS   AGE
pod/istio-ingress-6f7c5dffd8-g1szr   1/1     Running   0          4m28s

NAME                    TYPE           CLUSTER-IP      EXTERNAL-IP                                                                     PORT(S)                                      AGE
service/istio-ingress   LoadBalancer   172.20.104.27   k8s-istioing-istioing-844c89b6c2-875b8c9a4b4e9365.elb.us-west-2.amazonaws.com   15021:32760/TCP,80:31496/TCP,443:32534/TCP   4m28s

Verify all the Helm releases installed in the istio-system and istio-ingress namespaces:

helm list -n istio-system

NAME           NAMESPACE    REVISION UPDATED                              STATUS   CHART          APP VERSION
istio-base    istio-system 1        2023-07-19 11:05:41.599921 -0700 PDT deployed base-1.18.1    1.18.1
istiod        istio-system 1        2023-07-19 11:05:48.087616 -0700 PDT deployed istiod-1.18.1  1.18.1

helm list -n istio-ingress

NAME           NAMESPACE    REVISION UPDATED                              STATUS   CHART          APP VERSION
istio-ingress istio-ingress 1        2023-07-19 11:06:03.41609 -0700 PDT  deployed gateway-1.18.1 1.18.1

Observability Add-ons¶

Validate the setup of the observability add-ons by running the following commands and accessing each of the service endpoints using this URL of the form http://localhost:\ where <port> is one of the port number for the corresponding service.

# Visualize Istio Mesh console using Kiali
kubectl port-forward svc/kiali 20001:20001 -n istio-system

# Get to the Prometheus UI
kubectl port-forward svc/prometheus 9090:9090 -n istio-system

# Visualize metrics in using Grafana
kubectl port-forward svc/grafana 3000:3000 -n istio-system

# Visualize application traces via Jaeger
kubectl port-forward svc/jaeger 16686:16686 -n istio-system

Example¶

Create the sample namespace and enable the sidecar injection on it

kubectl create namespace sample
kubectl label namespace sample istio-injection=enabled

namespace/sample created
namespace/sample labeled

Deploy helloworld app

cat <<EOF > helloworld.yaml
apiVersion: v1
kind: Service
metadata:
  name: helloworld
  labels:
    app: helloworld
    service: helloworld
spec:
  ports:
  - port: 5000
    name: http
  selector:
    app: helloworld
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: helloworld-v1
  labels:
    app: helloworld
    version: v1
spec:
  replicas: 1
  selector:
    matchLabels:
      app: helloworld
      version: v1
  template:
    metadata:
      labels:
        app: helloworld
        version: v1
    spec:
      containers:
      - name: helloworld
        image: docker.io/istio/examples-helloworld-v1
        resources:
          requests:
            cpu: "100m"
        imagePullPolicy: IfNotPresent #Always
        ports:
        - containerPort: 5000
EOF

kubectl apply --server-side -f helloworld.yaml -n sample

service/helloworld created
deployment.apps/helloworld-v1 created

Deploy sleep app that we will use to connect to helloworld app

cat <<EOF > sleep.yaml
apiVersion: v1
kind: ServiceAccount
metadata:
  name: sleep
---
apiVersion: v1
kind: Service
metadata:
  name: sleep
  labels:
    app: sleep
    service: sleep
spec:
  ports:
  - port: 80
    name: http
  selector:
    app: sleep
---
apiVersion: apps/v1
kind: Deployment
metadata:
  name: sleep
spec:
  replicas: 1
  selector:
    matchLabels:
      app: sleep
  template:
    metadata:
      labels:
        app: sleep
    spec:
      terminationGracePeriodSeconds: 0
      serviceAccountName: sleep
      containers:
      - name: sleep
        image: curlimages/curl
        command: ["/bin/sleep", "infinity"]
        imagePullPolicy: IfNotPresent
        volumeMounts:
        - mountPath: /etc/sleep/tls
          name: secret-volume
      volumes:
      - name: secret-volume
        secret:
          secretName: sleep-secret
          optional: true
EOF

kubectl apply --server-side -f sleep.yaml -n sample

serviceaccount/sleep created
service/sleep created
deployment.apps/sleep created

Check all the pods in the sample namespace

kubectl get pods -n sample

NAME                           READY   STATUS    RESTARTS   AGE
helloworld-v1-b6c45f55-bx2xk   2/2     Running   0          50s
sleep-9454cc476-p2zxr          2/2     Running   0          15s

Connect to helloworld app from sleep app and verify if the connection uses envoy proxy

kubectl exec -n sample -c sleep \
    "$(kubectl get pod -n sample -l \
    app=sleep -o jsonpath='{.items[0].metadata.name}')" \
    -- curl -v helloworld.sample:5000/hello

* processing: helloworld.sample:5000/hello
...
* Connection #0 to host helloworld.sample left intact

Destroy¶

The AWS Load Balancer Controller add-on asynchronously reconciles resource deletions. During stack destruction, the istio ingress resource and the load balancer controller add-on are deleted in quick succession, preventing the removal of some of the AWS resources associated with the ingress gateway load balancer like, the frontend and the backend security groups. This causes the final terraform destroy -auto-approve command to timeout and fail with VPC dependency errors like below:

│ Error: deleting EC2 VPC (vpc-XXXX): operation error EC2: DeleteVpc, https response error StatusCode: 400, RequestID: XXXXX-XXXX-XXXX-XXXX-XXXXXX, api error DependencyViolation: The vpc 'vpc-XXXX' has dependencies and cannot be deleted.

A possible workaround is to manually uninstall the istio-ingress helm chart.

terraform destroy -target='module.eks_blueprints_addons.helm_release.this["istio-ingress"]' -auto-approve

Once the chart is uninstalled move on to destroy the stack.

terraform destroy -target="module.eks_blueprints_addons" -auto-approve
terraform destroy -target="module.eks" -auto-approve
terraform destroy -auto-approve

See here for more details on cleaning up the resources created.