From the course: CompTIA SecAI+ (CY0-001) Cert Prep

Unlock this course with a free trial

Join today to access over 25,600 courses taught by industry experts.

Data collection

Data collection

High quality data is the foundation of every AI system. The quality, diversity, and trustworthiness of that data determines how reliable the model will be once it's deployed. During the data collection stage, teams gather information from approved and verifiable sources. Each data set should be reviewed for accuracy, completeness, and authenticity. It should also be checked for representativeness, provenance, bias, and fairness. Using cryptographic hashes or digital signatures can help confirm that data hasn't been tampered with during transfer or storage. Legal and ethical considerations are equally important during this process. When personal or sensitive information is involved, consent must be explicit and documented. Privacy laws such as GDPR or HIPAA require that individuals understand how their data will be used and that it's collected only for legitimate purposes. Here's a quick exam tip for you. Be ready to assess the security risks of different data sources. You may be…

Contents