This post is the first in a five-part series on improving the development process (and coaching developers) with metrics-driven insights. Move fast and try things — that’s how we develop products at Indeed. We don’t believe in betting on a small number of great ideas. Instead, we bet on exploring lots of ideas as quickly […]
All posts categorized in: Big Data
Finding Anomalies in User Behavior with Python
In the course of helping over 200 million unique visitors every month find jobs, we end up with a lot of data. The data we collect can tell us a lot about the behavior of our users, and for the most part we observe predictable patterns in that behavior. But unexpected changes could be evidence […]
Vectorized VByte Decoding: High Performance Vector Instructions
Data-driven organizations like Indeed need great tools. We built Imhotep, our interactive data analytics platform (released last year), to manage the parallel execution of queries. To balance memory efficiency and performance in Imhotep, we developed a technique called vectorized variable-byte (VByte) decoding. VByte with differential decoding Many applications use VByte and differential encoding to compress […]