The unSkript Blog

Categories

unsafe path

Security Checkup: Force AWS Load Balancers to redirect to HTTPS

When on a hike – you always want to take the most secure path, and not risk taking an unsafe route.  With so many features now in the Cloud, it can be tricky to ensure that all of the traffic in and out of your services is secured over HTTPS – both for the safety…

Copy AMI Images across regions

Copying AWS Amazon Machine Images Across Regions with unSkript

When you are building a distributed platform, You’ll need to regularly update the machines that you have deployed around the world.  With Amazon Web Services (AWS), one way to do this is to deploy the updated machine in one region, and create an Amazon Machine Image (AMI) of that server.  Then, by copying that AMI…

Stethascope and mask

Building a Healthcheck RunBook: Analysis of K8s Logs

There’s nothing worse than an outage. Every outage has a cost – both in revenue, sales, but also reputation and perceived reliability.  No one wants their company to appear in the news as having a DevOps issue that cratered their service.   Building RunBooks to perform regular health checks on your system is an effective way…

modular construction: Mostly uniform, but easy to customize to the customer's exact needs.

Extending unSkript Actions: Quick Customizations of ‘Off the Shelf’ Code

unSkript has hundreds of built-in Actions that allow you to quickly build integrated Runbooks with popular Cloud providers, databases, and more.  But sometimes, the supplied Actions are not exactly what you are looking for.  Luckily, since each Action is written in Python, it is easy to extend and customize any existing Action to exactly suit…

Screenshot 2022-12-20 at 12.10.27

Will AI replace us? Using ChatGPT to create Python Actions for unSkript

Over the last few weeks, ChatGPT has been the buzz of the development world.  One of the biggest questions that has been asked is “Is the AI coming to replace developers?”  Will a fleet of Daleks come after us shouting “exterminate?” Or will AI code generators just become another tool in our arsenal? In this…

Image of paper tags

Managing your Cloud Costs with CloudOps Automation Part 1: Identifying Your Resources with Tags

Moving systems to the cloud makes a lot of sense operationally – letting the experts take care of the infrastructure, and let us build what we need to make our company successful. But this comes at a substantial downside – your monthly cloud bill.  Cloud providers have made it insanely easy to spin up new…

A stereo ornament hanging on a Christmas Tree.

Cloud Ops Auto Remediation: A Holiday Allegory

Auto remediations are tools that respond to events with automations able to fix, or remediate, the underlying condition. Building a demo that features auto remediation fix is hard, because generally modern infrastructure is resilient so keeping it in an error state is difficult. In order to feature an auto-remediation example, we’re going to get a little creative.…
mountains

Automate the triviality in AWS using unSkript

Introduction This week, AWS is celebrating all that is AWS at their re:Invent conference in Las Vegas. Here at unSkript, we are celebrating all that is AWS by highlighting how to automate common AWS tasks with our xRunBooks. In our first series (Part 1, Part 2, Part 3) , we built a xRunBook that creates an IAM…
policies

Automating the AWS Identity Lifecycle with unSkript (Part 3)

This week, AWS is celebrating all that is AWS at their re:Invent conference in Las Vegas. Here at unSkript, we are celebrating all that is AWS by highlighting how to automate common AWS tasks with our xRunBooks. In our first two posts in this series (Part 1 and Part 2) , we built a xRunBook that creates…
attachpolicy

Automating the AWS Identity Lifecycle with unSkript (Part 2)

This week, AWS is celebrating all that is AWS at their RE:Invent conference in Las Vegas. Here at unSkript, we are celebrating all that is AWS by highlighting how to automate common AWS tasks with our xRunBooks. In our last post, we began the process of building an Identity Management lifecycle workflow — by using the open…
xrunbooks

Automate the AWS Identity Lifecycle with unSkript (Part 1)

This week we are celebrating AWS re:Invent by highlighting common AWS tasks that are ripe for automation. In our first three posts for re:Invent week, we’ll build and extend some basic identity management xRunBooks to automate a simple identity lifecycle management process. unSkript offers a framework for your CloudOps automations built on top of Jupyter…
elastic

How to perform an Elasticsearch Rolling Restart

Database-driven search engines often take a long time to respond to queries. This is where search engines like Elasticsearch come into play by storing, retrieving, and managing data using a NoSQL database. Even though the primary purpose of Elasticsearch is to make the data available at all times, there may come a time when one…
dns

DNS: Is your domain healthy?

DNS: Is your domain healthy? When your website goes down, you risk losing customers, revenue, and credibility. When there is an issue — it is critical to know as quickly as possible in order to resolve the problem. In this post, we’ll build an automated Runbook to check the DNS of our website at regular…
Screenshot 2022-12-04 at 10.09.08

Part 3 – Proactive Diagnostics and Remediation for Platform Reliability and Incident Response

In our first post we answered how you can start or improve your incident response using runbook automation in your organization.   The second post gives you a framework for evaluating processes and technologies for the phases we see that leading organizations are building in the space.    In this third post, we discuss how…

secopsfeaturedimage

SecOps Automation: Reminder for SSL Certificate Renewal

These days, there is no excuse not to enable HTTPS for your website. In fact, it is mandated by Google in early 2018. As per the report given by major certificate issuing authorities, >85% of websites are using SSL certificates. However, certificates aren’t perpetual, meaning you’ll eventually need to renew them and do so correctly. SSL…
fire_buckets

Part 2 — Best Practices for Modern Incident Response: Why Monitoring, Alerting and Remediation are Not Enough

This second post in a 3-part series specifically looks into the ways leading organizations handle incident response today. In our first post we answered how you can start or improve your incident response using RunBook automation in your organization. The second post gives you a framework for evaluating processes and technologies for the three phases we see…
part1

Part 1 — What is Incident Response, how does it work, and what do the best do?

This is the first in a three-part series designed to help you start or improve your incident response and runbook automation in your organization. Goals are to give you an unbiased framework for evaluating where you are, what leaders in the space are doing, and where things are going. This post specifically looks into the…
rotatingkeys

Rotating AWS Access Keys for Enhanced Security

Cloud adoption is going on everywhere. One of the first things users learn when onboarding into their shiny new AWS cloud account is access controls! To consume cloud with AWS cli, access keys based authentication is a very popular method. We can think of the secret key as a username and the Access key ID…
healthcheck

Have you ever: Built a Service Health Check

Most of the tech giants including companies like Amazon, Netflix, started to build their systems using a monolithic architecturebecause back in the time it was much faster to set up a monolith and get the business moving. But over time as the product matures or fast growth happens, with growing systems the code gets more…

photo-1432888498266-38ffec3eaf0a

Three Critical Capabilities for Intelligent Automation of Incident Response

In my last blog, we looked at the various challenges impacting incident response and why its mostly a manual process. It’s time to look at what it takes to introduce automation into incident response. We see this as a three pronged approach: platform, content & context and customizability. Let me explain. Open and Extensible Platform In…

whymanualprocessfeatures

Why is incident response such a manual process?

Tell tale signs that you need to take a close look at your MTTR In today’s agile world, service reliability can be directly correlated with customer satisfaction and successful business outcomes. However, manual incident response can affect your service reliability for the following five reasons. Let’s walk through them one at a time.   Too…

Domino Effects of the Facebook Outage

The need for Responsible Automation of Production Incidents Unless you are living under a rock, you saw the Facebook outage play out earlier this week. While not being able to share and comment on cat pictures was the common outcome for most customers, the story was much bigger in the operations community. Facebook has already…