In this presentation we describe how we developed modules and pipelines for the industrial processing of data and content from a machine learning project in the legal domain. Lawyers’ work is far from being digitized. They need to deal with large amounts of printed documents of various types (e.g. letters, contracts, invoices, orders, offers, court documents). A single mandate could consist of several hundreds of paper folders. To work on specific aspects of a case, they need to review, annotate, reorder, even reprint and reattach their documents to a new folder. On top of that, legal disputes can take several years to settle. It´s not easy to keep track of all the relevant details. One of the main pain points for lawyers is to be able to grasp the connections and processes in a mandate in a reasonable amount of time. This setting became the starting point for our core use case and our user journey: With CaseWorx, we support lawyers to organize and to structure documents of a construction law mandate by applying machine learning and semantics combined with domain knowledge!