7 min read

What is vendor lock-in (and how to break free)?

By CAST AI on Jan 14, 2021 5:42:40 AM

Vendor lock-in is one of the greatest constraints to your flexibility if you decide to rely on a single cloud provider.

Topics: multi cloud
7 min read

The no-nonsense guide to IaC and Terraform

By Augustinas Stirbis on Jan 12, 2021 11:57:11 AM

One senior DevOps guy that I recently spoke to said he would burst into tears of despair if he got a request like one in the meme above. Don’t worry, we’ve got you covered.

Enter Terraform

Sometimes products are so popular that the whole market becomes synonymous with the product's name - walkman, photoshopped, velcro, to google, to terraform are just a few examples.

Terraform has for quite some time been de facto synonymous with Infrastructure as Code (IaC). It is an open-source tool released in 2014 that basically does one thing and does it extremely well. Terraform makes sure that IT infrastructure (real world) is consistent with the desired configuration. 

In other words, Terraform is a state machine that can manage resources backed by REST API.

This won't be a HOW TO tutorial piece that duplicates the role of documentation. I will instead share my thoughts on the WHY of Terraform.

I.LOVE.TERRAFORM. I have used it since 2016. We used Terraform to validate our product’s proof of concept at CAST AI.

There is a market challenger called Pulumi, but I don’t think anything really threatens Terraform’s domination at this point.

If you are an engineer and you haven't heard of Terraform yet, your resume needs urgent CPR (find an hour to complete Hashicorp’s excellent tutorial).

6 min read

Kubernetes cost estimation: 4 problems and how to solve them

By CAST AI on Jan 8, 2021 6:57:21 AM

If you work with Kubernetes, you know this pain: estimating the cost of a specific Kubernetes workload is really hard. We talked about why controlling cloud costs is so difficult here.

But there are a few things you can do to ease this pain and arrive at a more accurate estimation of your workload costs.

Let’s go over the strategies to help you control the costs and make the most of your resources.

Topics: Kubernetes
4 min read

Kubernetes log processing: 4 challenges and how to solve them

By CAST AI on Jan 7, 2021 6:55:58 AM

Application logs are a great help in understanding what's happening inside the application. They come in handy for debugging and monitoring cluster activity. But what are the most common challenges in Kubernetes log processing, and how can developers deal with them? 

Topics: multi cloud
3 min read

Google Anthos vs CAST AI Comparison: Who Wins?

By Leon Kuperman on Jan 6, 2021 11:02:41 AM

The idea that you can build an app in one place and then deploy it across multiple clouds is an enticing one. That’s why multi-cloud solutions are on the rise today. They promise to help developers in expanding their cloud portfolios and battle for the industry’s attention.

Topics: Insider anthos multi cloud
1 min read

AWS down again: Slack, Zoom, Notion likely affected. Why sink with them?

By CAST AI on Jan 5, 2021 12:43:50 PM

Amazon Web Services (AWS) had a second outage in two months on the 4th of January. Slack, Zoom and Notion were likely among the affected businesses.
On the 25th of November, AWS had another major outage. Widely utilized services like Amazon API Gateway, Lambda, DynamoDB, CloudWatch, CloudTrail, CloudFormation, Autoscaling, Athena and others were affected.

"So many services being impacted simultaneously may be indicative of a large software rollout, or potentially localized underlying hardware or network issues,” said Leon Kuperman, CTO and Co-Founder at CAST AI. “If it’s a software issue, it’s unusual that AWS would have chosen US-East as a canary region. US-East (Virginia) represents the largest AWS region and largest customer footprint, which would put it near last, if not last on the rollout schedule." 

Topics: news
2 min read

Can Google unplug the internet?

By CAST AI on Dec 14, 2020 7:25:57 AM

Update: Google published some more details about the incident in the Google Cloud Status Dashboard.

As it turned out, the Google User ID Service that handles authentication credentials for OAuth tokens and cookies detected outdated data. Google was migrating the Service to a new quota system.

While they applied a change in October to register the User ID Service with the new quota system, some parts of the previous quota system were left in place. This resulted in incorrect reports on the usage for the User ID Service as 0. An existing grace period on enforcing these quota restrictions delayed the impact, but it eventually expired and triggered the automated quota systems to decrease the quota allowed for the User ID service.

They just did. Google followed in AWS footsteps and let a portion of their key services to go down. Google products – YouTube, Google Image Search, Google Analytics and the whole Google Workspace package (Gmail, Calendar, Drive, Sheets, Docs, Slides and others) were inaccessible globally.

We identified the main culprit – Google OAuth service, which is used to access Google products and access to many major apps like Asana, HubSpot, Discord, Slack, Figma, Monday.com and many others. If your organization is reliant on one of these products, you were left barehanded without the tools needed to do your work. CAST AI uses several independent services to log in to our platform, including Google OAuth and others, precisely to eliminate the risk of one sign-in tool being unavailable.


Engineers usually joke that if Google goes down, the internet goes down. That’s how tightly knit the two are. But that proves the cost of over-relying on cloud native applications and services from just a single cloud provider. For startups and enterprises alike, today’s reliance on Google will cost many millions of dollars. The cloud leviathan will survive, the companies will have to recoup on their own.

This time it was OAuth. Next time it could be Compute, Network, Storage, and other Infrastructure as a Service (IaaS) related services. Complete dependence on one cloud provider puts your business in danger of being taken down together with Google, AWS or Azure.

“A cloud failure should not become a disaster. Never again.” said Laurent Gil, CPO and Co-Founder at CAST AI. “You can rest easy knowing that your infrastructure is never down.” Companies have to take back control and have a multi-cloud disaster recovery and fail-over strategy. This way, Asana, HubSpot, Discord, Slack, Figma, and Monday.com at least can know that their users are left unscathed.

CAST AI empowers companies to run applications on multi cloud, and automatically shifts resources to Azure or AWS when Google Cloud has troubles. Don't let your cloud consumption be affected by a single cloud company. 

Topics: news
4 min read

Kubernetes deprecating Docker - what’s the deal with that?

By Rokas Bilevičius on Dec 10, 2020 4:52:24 AM

Everything is fine. Docker images will still work on future Kubernetes versions. Read this article if you’d like to know why this won’t actually affect you at all.

Kubernetes recently announced that it’s deprecating Docker as a container runtime after v1.20. But this announcement is more of a viral headline than a real revolution.

Topics: news
1 min read

When AWS sinks, why sink with it?

By CAST AI on Nov 25, 2020 11:55:19 AM

Amazon Web Services (AWS) is having a major outage today. Widely utilized services like Amazon API Gateway, Lambda, DynamoDB, CloudWatch, CloudTrail, CloudFormation, Autoscaling, Athena and others were affected.

Topics: news
11 min read

Cloud me a river

By Leon Kuperman on Nov 24, 2020 1:02:50 PM

“Multi-cloud is the worst practice” – is it, really?

Post written by Leon Kuperman, CTO at CAST AI.