What’s with all the Ops?

DevOps, DataOps, MLOps – the number of different “Ops” combinations seems to have exploded over the last year or so. There are manifestos, meetups, lots of blog posts and research articles about these various approaches. In order to get clear on terminology, I think it’s good to define what we’re talking about. So, first, DevOps … Read moreWhat’s with all the Ops?

AI engineering part 2: data versioning and dependency management

In my last column, I presented our research agenda for AI engineering. This time, we’re going to focus on one of the topics on that agenda, ie data versioning and dependency management. Even though the big data era has been with us for over a decade now, many of the companies that we work with … Read moreAI engineering part 2: data versioning and dependency management

How to generate data for machine learning

In recent columns, I’ve been sharing my view on the quality of the data that many companies have in their data warehouses, lakes or swamps. In my experience, most of the data that companies have stored so carefully is useless and will never generate any value for the company. The data that actually is potentially … Read moreHow to generate data for machine learning

AI is NOT big data analytics

During the big data era, one of the key tenets of successfully realizing your big data strategy was to create a central data warehouse or data lake where all data was stored. The data analysts could then run their analyses to their hearts’ content and find relevant correlations, outliers, predictive patterns and the like. In … Read moreAI is NOT big data analytics

Why your data is useless

Virtually all organizations I work with have terabytes or even petabytes of data stored in different databases and file systems. However, there’s a very interesting pattern I’ve started to recognize during recent months. On the one hand, the data that gets generated is almost always intended for human interpretation. Consequently, there are lots of alphanumeric … Read moreWhy your data is useless

When you don’t know, run experiments

The effects of digitalization and other technological shifts cause companies to realize they need to change. This often leads to significant discussions in the organization as there typically are several alternatives being considered by different people. These might include topics such as business models, product implications, partnerships with suppliers and technology providers. Agreeing on the … Read moreWhen you don’t know, run experiments

Why you’re a product + service business

Before digitalization became a thing, the industry was divided into two types of businesses: those that sell products and those that provide services. Those that sell products have a transactional relationship with their customers, mostly consisting of selling one of their products. The service businesses tend to have a more continuous relationship with their customers … Read moreWhy you’re a product + service business

Get your data out of the gray zone

Last week, I wrote about the different types of use for data that we have available. That led to discussions with various people and I realized that there’s a problem around data that’s very typical in companies that have their roots in embedded systems or mechanical engineering: it’s actually unclear who owns the data from … Read moreGet your data out of the gray zone