- Omkar Jois (PES1UG21CS397)
- Ananya Jha (PES1UG21CS077)
- Kunjam Nanavaty (PES1UG21CS294)
- Bhoomi Bhat (PES1UG21CS138)
Object detection is a crucial task in computer vision with applications ranging from autonomous driving to surveillance systems. However, the performance of object detection models heavily relies on the quality and diversity of the training data. Data augmentation techniques have been widely adopted to address the challenges posed by limited and imbalanced datasets. In this paper, we propose leveraging generative AI methods for data augmentation in object detection tasks. We explore various generative models and augmentation strategies to enhance the performance and robustness of object detection systems.
The Vehicle dataset is a dataset with 810 annotated images, meticulously compiled to serve as a proof of concept for vehicle detection through object recognition systems. Initially curated to challenge the capabilities of contemporary object detection algorithms, this dataset is specifically designed to include a diverse array of vehicles in various environmental settings, angles, and lighting conditions.

The Bottle dataset is a curated collection comprising 150 images of bottles, forming the cornerstone of a proof of concept aimed at enhancing object detection capabilities. This bottle dataset is characterized by its variety, containing images of bottles of different shapes, sizes, colors, and materials, captured in assorted settings and lighting conditions to mimic real-world complexity.

- The bottles folder contains the train, test, and val folders to train the YOLOv8 model for detecting bottles. This is the dataset with no augmentation.
- The train.py file is used to train the YOLO model. To run it just run
python3 train.py - The test.py is used to test the model on the test1 and test2 videos. To test the video srun
python3 test.py. It will create an out.mp4 folder with the result. - To change the test videos for testing, open the test.py file and change the path in the
VideoCapture()function to your desired path.
- The Vehicles_new folder contains the train, test, and val folders to train the YOLOv8 model for detecting vehicles. This is the dataset with no augmentation.
- The train.py file is used to train the YOLO model. To run it just run
python3 train.py - The test.py is used to test the model on the test1 and test2 videos. To test the video srun
python3 test.py. It will create an out.mp4 folder with the result. - To change the test videos for testing, open the test.py file and change the path in the
VideoCapture()function to your desired path.
- The Vehicles_normal folder contains the train, test, and val folders to train the YOLOv8 model for detecting vehicles. This is the dataset with augmentation.
- The train.py file is used to train the YOLO model. To run it just run
python3 train.py - The test.py is used to test the model on the test1 and test2 videos. To test the video srun
python3 test.py. It will create an out.mp4 folder with the result. - To change the test videos for testing, open the test.py file and change the path in the
VideoCapture()function to your desired path.
- This folder has the codes to create the augmented data with Gen AI.
- to run the detection model run
python3 main.py. Change the prompts list and the number of duplicates to choose the number of variants and type of variants you want. The paths of teh source and destination of the labels can also be changed according to your needs.


