AI models often require diverse datasets to perform well in real-world scenarios. Video streams, commonly captured at 30-60 frames per second, can produce multiple identical frames, making it difficult to curate the right data efficiently. Traditional downsampling methods may miss valuable information and lead to poorer model performance.