18
Why does everyone act like you need a huge dataset to train a decent model?
In my experience, using a focused set of just 500 high quality images with a tool like PyTorch and a lot of data augmentation can get you 85% of the way there for a basic classification task, which is plenty for a lot of real world uses.
2 comments
Log in to join the discussion
Log In2 Comments
mary_schmidt20d ago
Totally agree, it's like this weird myth in the air. I built a pretty good tool to sort my own photos (cats vs. dogs, you know) with maybe 400 pictures and a ton of flips and color shifts. It wasn't perfect, but it did the job for my messy personal library. People get hung up on these massive numbers when a small, clean set you really understand can work wonders.
1
blair39620d ago
Yeah but @mary_schmidt, is a cat/dog sorter really that big a deal... it's just organizing photos.
-1