Using Metrics to Improve the Development Process (and Coach People)

Posted on October 19, 2018 by Jack Humphrey

In the previous post, I described Imhotep, our scalable, efficient, and fast open source data analytics platform. Imhotep helps us use metrics at Indeed for fast, iterative experimentation, which drives improvement to our products.

Improving process and coaching people

We use the same tools and techniques to improve development processes, following a measure-question-learn-improve cycle:

Measure everything we possibly can.
Learn by asking questions and exploring the data we’ve collected.
Use our learnings to try to improve.
Measure continuously to confirm improvement.

Beyond process improvements, this approach can also work for people. Data can help us understand our own work and coach others.

How much am I getting done?
How am I engaging with other teams?
How has my work changed over time?
What are my blind spots?

Is measuring processes and people a good idea?

You might be skeptical of using this approach for improving process and measuring people. It’s good to be skeptical. To truly benefit from this approach, you must proceed with caution.

Gaming the stats (Goodhart’s Law)

The first caution is Goodhart’s Law, which states that “When a measure becomes a target, it ceases to be a good measure.” For example, your manager might say: “Our measurements show that our team productivity is declining. Let’s set a target to increase features completed by 20% next quarter. If we hit it, big bonuses all around!”

Okay, but your team might then start “gaming the stats” — making changes that improve the metric without improving productivity. Now your measure is meaningless for gauging productivity, and you’ve rewarded your team for counterproductive measures that don’t advance your goals.

The Number Six Principle

The second caution is something I’ve named the Number Six Principle (inspired by a classic TV character and his famous line): Don’t reduce people to a set of numbers.

Image illustrating the Number Six Principle with header text "I am not a number" and a silhouette of character Number Six from TV show "The Prisoner."

No one enjoys being judged entirely by numbers. If you tell people you’re measuring them, you run the risk of seriously damaging morale. Many people will assume you’re not considering qualitative performance elements.

It’s how you use them

You can avoid these pitfalls if you’re careful. Consider the example above in which your team is concerned about slipping productivity metrics. If you take a close look at the numbers, understand them in context, and diagnose the situation, you can have a productive dialog about how to improve.

Perhaps your team tackled more complex features, therefore completing fewer. That might be okay, or you might agree as a team that you could have done a better job of simplifying the feature work.

Or maybe you look at a different metric and see that your overall support load went up 50% due to growth in your customer base. You can then decide to live with that balance or try to augment your team’s capacity to handle support while developing new features.

Starting with the measurements, a considered discussion can lead to tangible process improvement. In the next post, I describe a process improvement we validated and measured with Imhotep.

Read the full series of blog posts:

Imhotep: Scalable, Efficient, and Fast
Using Metrics to Improve the Development Process (and Coach People)
Metrics-Driven Process Improvement: A Case Study
What’s Up, ASF? Using Imhotep to Understand Project Activity
The Benefits of Hindsight: A Metrics-Driven Approach to Coaching

Using Metrics to Improve the Development Process (and Coach People) cross-posted on Medium.

Imhotep: Scalable, Efficient, and Fast

Posted on October 18, 2018 by Jack Humphrey

This post is the first in a five-part series on improving the development process (and coaching developers) with metrics-driven insights.

Move fast and try things — that’s how we develop products at Indeed. We don’t believe in betting on a small number of great ideas. Instead, we bet on exploring lots of ideas as quickly as possible.

To be successful in this approach, we need innovative team members with diverse perspectives. We hire people who are excited to quickly explore ideas in service of our mission — to help people get jobs. Once they’re on board, we give them ownership and autonomy to do exactly that. And we give them the tools to track and analyze their experiments.

The right tools for the job

We’ve developed and open sourced some of these tools, including Imhotep, our data analytics platform. Imhotep enables rapid exploration and analysis of large time-series datasets. It includes a query language (IQL), a web-based UI, and a distributed backend. It is scalable, efficient, and fast.

Graphic displays an Imhotep motto: measure. question. learn. improve.

Imhotep is scalable

Imhotep scales horizontally by adding daemon instances that can run on commodity hardware or in the cloud. Indeed’s internal Imhotep cluster handles up to 5 million queries each week across thousands of datasets. Roughly 90% of those queries come from automated systems.

Our most popular dataset includes about 39 billion events just for the last year. That dataset alone receives around 25,000 distinct queries each month.

Imhotep is efficient

Because the data structure underlying Imhotep is an inverted index, the disk utilization is remarkably low for most time-series datasets. The dataset mentioned above, with 39 billion events and 384 possible fields per event, takes up 5.7 terabytes on disk. That works out to 146 bytes per event.

That kind of storage efficiency allows us to keep all the data for analysis and avoid sampling. Sampling is fine when you want to just look at aggregate trends. But if you want to actually dig down into your data and examine the outliers, you can’t reliably find them or see their effects if you sample.

Imhotep is fast

Imhotep’s speed lets us rapidly iterate and collaborate. Over a recent 90-day period at Indeed, our internal cluster saw around 2 million interactive Imhotep queries (queries done from the webapp). The median response time for those queries was 276 milliseconds.

A powerful cache implementation contributes to this blazing speed, with nearly 60% of interactive queries coming from the cache. But even uncached queries are quite fast, with a median response time of around 4 seconds. An uncached query over a long time span takes longer, but not that much longer. For uncached queries with a 365-day time span, the median response time is about 9 seconds.

How do we know all these stats about Imhotep performance? Because we have an Imhotep dataset for Imhotep usage. In just a few minutes, I was able to iteratively query that dataset to understand recent cluster performance.

Imhotep drives insight and improvement

Imhotep empowers us to experiment and quickly improve our products. We’ve also applied this data-driven approach to improving development processes. In the next post in this series, I explain more about how we use metrics to improve process.

Read the full series of blog posts:

Imhotep: Scalable, Efficient, and Fast
Using Metrics to Improve the Development Process (and Coach People)
Metrics-Driven Process Improvement: A Case Study
What’s Up, ASF? Using Imhotep to Understand Project Activity
The Benefits of Hindsight: A Metrics-Driven Approach to Coaching

Imhotep: Scalable, Efficient, and Fast cross-posted on Medium.

Audit Your Web Applications with AVA

Posted on September 13, 2018 by Jonathan Hawes

Hosting a web application is an excellent way to expose useful services to the public, but it comes at a cost: vulnerabilities in your web apps could allow attackers to access important systems, endangering your customers and your business.

The Indeed Security team developed Another Vulnerability Auditor (AVA) to address this problem. By using AVA to automate application scans, we can continuously monitor production and QA systems for potential vulnerabilities. And, because we have released it as an open source tool, you can use it to monitor your applications as well.

How does it work?

AVA scans a set of application endpoints, defined in HTTP Archive (HAR) format. The HAR file catalogs the URL, headers, cookies, and POST data for HTTP requests. AVA uses this information to model endpoints and scan them using a combination of auditors and checks.

Auditors

Auditors determine the HTTP elements that AVA audits. These include URLs, headers, cookies, and POST data.

Type	What does it audit?
Cookie	Individual cookies in the Cookie request header
Header	Most request headers
JSON	JSON data in the request body
Multipart	Multipart form data in the request body
Parameter	Parameters in the URL query string and request body
Response	Aspects of a response (passive audit)
Text	Plain text data in the request body
URL	Request URL

Checks

Checks determine the types of security vulnerabilities AVA checks. These include cross-site scripting, open redirects, SQL injection, and shell injection.

Type	What does it check for?
Code injection	Code injection in Python’s pickle and eval statements
Header injection	Header injection in responses
Open redirect	Open redirects to arbitrary URLs
Path traversal	Path traversal on the local filesystem
Shell injection	Shell injection in Bash statements
SQL injection	SQL injection in database queries
Cross-site scripting	HTML and JavaScript injection in responses
XML external entity	XML external entities in XML documents
Personally Identifiable Information (PII)	Email addresses, credit cards, and Social Security numbers

How can I use it?

We designed AVA for use within automated systems. We automate AVA with Docker Swarm and Jenkins. However, you can use AVA anywhere Python can be installed.

Use in Docker Swarm

Indeed’s Security team uses Docker Swarm to automate AVA and scan public-facing applications daily. This allows us to identify vulnerabilities shortly after they are introduced. The pipeline has three components:

Enricher combines data from sources, such as WES, into endpoint definitions
Scheduler maintains a schedule and configuration
Vulnerability manager stores reports and displays vulnerability information

The process is as follows:

The scheduler contacts the enricher and requests endpoint definitions for the current application.
The enricher returns these definitions in HAR format.
The scheduler pushes the HAR data and configuration settings to AVA.
AVA runs the configured scan against the application and generates a report.
AVA sends the report to the vulnerability manager for storage.

Use in Jenkins

We also use AVA in Jenkins to examine systems in our QA environment. This allows us to identify vulnerabilities before they reach production. The pipeline has two components: functional tests and AVA. The functional tests are a collection of Selenium-based test cases for verifying release candidates in QA.