udmi

UDMI / Docs / Cloud / GCP / Swarm

UDMI Pubber Swarm Cluster

The ‘pubber swarm’ capability relies on four major components:

IoT Core registry configured as per a specific site_model
k8s cluster running a swarm of pubber nodes that simulate IoT Core client devices
cloud run function triggered by cron to manage swarm worker pool
glue PubSub topic/subscriptions to distribute work to nodes

Set project location if/as necessary for your organization
Docs assume GCP_PROJECT env variable is set appropriately, e.g. export GCP_PROJECT=udmi-swarm-example

Enable API
Get a site_model repo (e.g. zz-top-example), should end up in udmi/sites/{site_name}/
Create site registry
- Registry ID and region are defined in the {site_name}/cloud_iot_config.json file.
- Topic should be for received device data, e.g. data-target not the one for swarm-feed.
Run udmi registrar tool udmi$ bin/registrar sites/zz-top-example ${GCP_PROJECT}

Enable API
“GKE Standard” cluster (not “Autopilot”)
Name appropriately (e.g. “pubber-swarm”)
Enable Pub/Sub Access Scope
- NODE POOLS > default-pool > security
- Access scope “set access for each API”
  - Set “Cloud Pub/Sub” to “Enabled”
  - Maybe also set “Cloud Platform” to “Enabled” – not sure if this is required
Create cluster
Get kubectl certs: gcloud --project=${GCP_PROJECT} container clusters --zone=us-central1-c get-credentials pubber-swarm
Maybe need to enable access to the container registry? I did this, but not sure it’s required: gsutil iam ch serviceAccount:943284686802-compute@developer.gserviceaccount.com:roles/storage.objectViewer gs://us.artifacts.udmi-swarm-example.appspot.com/

udmi$ bin/deploy ${GCP_PROJECT}

This will:

Create a new topic called swarm-feed
Create a simple subscription also called swarm-feed
Set the message retention on the subscription to 10 minutes (minimum)
Add the default compute engine service account (something like XXXXXXXXX-compute@developer.gserviceaccount.com) as a “PubSub Subscriber” to the subscription.

Create a new cloud run service
Image will be something like us.gcr.io/udmi-swarm-example/validator (except with the right project instead of udmi-swarm-example)
Change maximum number of instances to 1
Memory allocation to 1GiB
Set Authentication: Require authentication
Select “Second generation execution environment”
Note the service URL, something like https://validator-t22jzfa4pq-de.a.run.app

Create a new job
Name swarm
Frequency of every 15 minutes with */15 * * * *
Target type “HTTP”
URL is something like https://validator-t22jzfa4pq-de.a.run.app/?site=zz-top-example&topic=swarm-feed
- The first bit is the taken from the Cloud Run service
- The site (zz-top-example) is the site_model from before
- Topic should point at the created PubSub topic.
HTTP method “GET”
Auth header “Add OIDC token”
Default Compute Engine Service account
Audience is the Cloud Run service URL
Go to IAM and add Cloud Run Invoker to roles for the Default Compute Engine service account

Cloud Run logs to make sure it’s getting triggered
PubSub topic to see it’s receiving info from the Cloud Run
PubSub subscription to see if the swarm pubber instances are pulling from the subscription
Check k8s pod logs to see if they’re getting targets and connecting properly
IoT Core to see if it’s receiving data
The registry topic (e.g. data-target) to see that it’s receiving data.
Validator to check received data udmi$ bin/validator sites/zz-top-example ${GCP_PROJECT} ${DATA_SUBSCRIPTION}

Control the number of simulated devices by scaling the number of replicas and pubber instances, total equals replicas * instances.
Use kubectl scale deploy pubber-pool --replicas=10, or editing the workload configuration file.
Edit the PUBBER_INSTANCES env variable in the deployment config.
Also limited by the number of devices in the site_model

This site is open source. Improve this page.