rocauc 11 daysReload

CoreNet: A library for training deep neural networks

rocauc 33 daysReload

Pulling out a key part of this post from a DeepMind 2023 paper[1]: “Although the success of ViTs in computer vision is extremely impressive, in our view there is no strong evidence to suggest that pre-trained ViTs outperform pre-trained ConvNets when evaluated fairly.”

Another common constraint in vision vs language is the long tails are very long in the visual world. There's a number of domains where you have very little examples to learn (defects are designed to happen infrequently; rare species for identification show up, well, rarely). And pulling from the blog: "But small models ... benefit greatly from the exact type experiment of outlined in this post: strong augmentation with limited data trained across many epochs."

[1] https://arxiv.org/pdf/2310.16764.pdf

rocauc 133 daysReload

suno has improved fast. I remember when they released Bark in April ‘23. it was good. but this new model is fun. props to the team.

rocauc 143 daysReload

Really neat. I tried your search for red shoes, and I found some, er, unexpected imagery on page 1.

One thing you could do is add semantic search so when a user searches "red shoes," the index returns images that look like red shoes even if the metadata doesn't say anything about color or item types. To do this, I'd use a model like CLIP. Here's an example of using CLIP and Supabase to do semantic image search: https://blog.roboflow.com/how-to-use-semantic-search-supabas...

rocauc 154 daysReload

In your inference project example, what examples do you place in Getting Started vs common usage examples? In general, where is the best place for usage examples - alongside the methods they use, or in an independent section?

rocauc 33 daysReload

[1] https://arxiv.org/pdf/2310.16764.pdf

rocauc 143 daysReload

Really neat. I tried your search for red shoes, and I found some, er, unexpected imagery on page 1.