From the course: Introduction to Career Skills in Data Analytics

Unlock this course with a free trial

Join today to access over 25,100 courses taught by industry experts.

Common cleaning and transformation

Common cleaning and transformation

- When building your cleaning and transformation toolbox, there's some common cleaning and transformation items you will use. Others will be more specific to the deeds of the data you work with. Let's start with general cleaning. Spaces are invisible to the eye, but in fact, they're characters. And when a field has extra spaces, you will want to clean those by removing them. There are leading spaces which are spaces that are at the front of the field. There are trailing spaces which are at the end of the field. When we want to remove either leading or trailing spaces, then we can use functions like trim or clean. The act of breaking out text is referred to as parsing text. And we can do this with any type of delimiter and every program handles this a little bit differently, but the outcome is the same. Spaces will also serve as a delimiter, like the spaces between words are valid spaces. Imagine first name and last name.���

Contents