u/ChazariosU

▲ 3 r/computervision+2 crossposts

Hello i have pictures of cellulite which looks like this:

https://preview.redd.it/kbh9lwxyqnzg1.jpg?width=320&format=pjpg&auto=webp&s=7a53cb3bd439335a9bf7281bbb5ebf0e45d4cb74

My task is to create classification model. There is 4 classes and only 140 pictures for every one of them. Before i started trying different architectures i removed logo and scale. Then i augment my pictures by:
- horizontal flip
- random sized crop
For now i have tried:
- swin transformer
- ConvNeXt
- my own convolutional neural network
All of these architectures have accuracy below 80 percent mainly 60 percent. Maybe someone knows techniques or architectures that will allow me to increase accuracy.

reddit.com
u/ChazariosU — 17 days ago