How to Write Data Transformations With Clean Code

How to Write Data Transformations With Clean Code

Sergii Volodko
Sergii Volodko

February 10, 2022


Data scientists are not primarily concerned with software, but poor code quality can quickly place limits on what their models can do. As they repeatedly ingest data, train their models, and analyze the results, they need reliable, reusable, and flexible code. Quality software can empower them with more time to focus on their main areas of expertise.

Reliable, reusable, and flexible — these descriptors are familiar to craft software professionals, and practices like domain-driven design (DDD) and test-driven development (TDD) can empower data scientists to innovate with confidence. The engineers who productionize the models can similarly benefit from a stable codebase that easily integrates with another software environment.

Sergii Volodko has spent more than a decade seeing these tensions and opportunities firsthand, working as both a data engineer for four years and a software engineer ever since. As a principal crafter at 8th Light, he has helped data science and development teams create overlapping practices and processes that produce more dependable solutions. In this presentation, he reflects on his experience and shares some key opportunities for software developers to help their colleagues embrace a new way of crafting their code.

Join the Dialogue

About 8th Light University

8th Light University (8LU) is a virtual event series curated by 8th Light, a global software consultancy that partners with clients in creating software solutions. Topics focus on improving the craft of software product, design, and development. Software professionals of all skill levels are welcome! Watch past event videos on YouTube, join us at our next 8LU event, or sign up for our newsletter to to find out about upcoming events.