4271 views|15 replies

6742

Posts

2

Resources
The OP
 

Friends who play with AI, where do you get your training sets? [Copy link]

 

Some image recognition training sets can be obtained from the Internet, but most of the images obtained from the Internet contain watermarks. For example, where can we get such training sets for digital recognition in this [AI Challenge]?

Images can be found online. If it is data from a product sensor, can it only be obtained from the product? If it can only be obtained from the product, it seems that there will be no difference in the data, which may cause the final trained model to be non-robust. What experience do you have? Please share it~

This post is from Embedded System

Latest reply

As for making training sets and test sets, and annotating and scaling images, these are all done by interns.   Details Published on 2024-6-24 16:38

5998

Posts

6

Resources
2
 

The comprehensiveness and representativeness of training data are very important. Image recognition is OK. There are many ways to obtain pictures. If you can't do it, you can take pictures yourself. The abstract data of sensors is more troublesome.

This post is from Embedded System

Comments

The sensor seems to be able to collect data only by itself, but the amount seems to be huge.  Details Published on 2024-6-20 17:04
 
Personal signature

在爱好的道路上不断前进,在生活的迷雾中播撒光引

 

6742

Posts

2

Resources
3
 
Qintianqintian0303 posted on 2024-6-20 16:38 The comprehensiveness and representativeness of training data are very important. Image recognition is okay. There are many ways to obtain pictures. If you can't do it, just take them yourself. The sensors are abstract...

The sensor seems to be able to collect data only by itself, but the amount seems to be huge.

This post is from Embedded System
 
 
 

1106

Posts

1

Resources
4
 

Crawler? Or just use Feifei's ImageNet?

This post is from Embedded System

Comments

The latter is? I am ignorant [:sad:]  Details Published on 2024-6-20 21:42
 
 
 

6742

Posts

2

Resources
5
 
Crawler? Or use Feifei's ImageNet directly?
Which is the latter? I am ignorant
This post is from Embedded System

Comments

Fei-Fei Li, a beautiful Chinese-American Computer Vision scientist and a well-known scholar, was once the vice president of Google.   Details Published on 2024-6-21 14:12
 
 
 

209

Posts

1

Resources
6
 

I would also like to ask what is Feifei's ImageNet? How to use it? What are the advantages?

This post is from Embedded System

Comments

Just check ImageNet and you will know... Very well-known in the CV field  Details Published on 2024-6-21 14:13
 
 
 

6742

Posts

2

Resources
7
 

I was wondering, can we use AI to help us collect images or produce sensor data?

This post is from Embedded System
 
 
 

7422

Posts

2

Resources
8
 

It depends on what data you want. Many places in Damei will provide it, but I’m not sure about Dongda.

For example, you can go to their official website to download meteorological data for previous years, for example, for economics, there are many, for example, for diseases, you can go to their official medical website. And so on.

There are also various AI competition official websites and forums, some of which provide data.

This post is from Embedded System

Comments

The suggestion of an official website and forum for AI competitions is a good one. Indeed, many competitions now have data sets.  Details Published on 2024-6-21 11:38
 
Personal signature

默认摸鱼,再摸鱼。2022、9、28

 
 

6742

Posts

2

Resources
9
 
freebsder posted on 2024-6-21 09:46 It depends on what data you want. Many places in Damei will provide it, but I am not sure about Dongda. For example, you can go to their official website to download the meteorological data of previous years...

The suggestion of an official website and forum for AI competitions is a good one. Indeed, many competitions now have data sets.

This post is from Embedded System
 
 
 

1106

Posts

1

Resources
10
 
wangerxian posted on 2024-6-20 21:42 The latter is? I am ignorant

Fei-Fei Li, a beautiful Chinese-American Computer Vision scientist and a well-known scholar...

Former Vice President of Google

This post is from Embedded System
 
 
 

1106

Posts

1

Resources
11
 
851779592 Published on 2024-6-21 09:11 I would like to ask what is Feifei's ImageNet? How to use it? What are the advantages?

Just check ImageNet and you will know... Very well-known in the CV field

This post is from Embedded System
 
 
 

209

Posts

1

Resources
12
 

OK, thank you! I have checked the relevant information, it is really great! Very good! Thank you for your recommendation and explanation!

This post is from Embedded System
 
 
 

4764

Posts

12

Resources
13
 

Basically it's free, a lot of datasets

If you want to do it yourself, you have to clean the pictures and label them, which is quite troublesome.

I have used Maix Hub and it is quite good, suitable for embedded devices

This post is from Embedded System

Comments

I think that going through this process is the only way to truly train the AI model.  Details Published on 2024-6-24 15:21
 
 
 

6742

Posts

2

Resources
14
 
Azuma Simeng posted on 2024-6-24 10:51 Basically it’s free. If you want to do a lot of datasets by yourself, you have to clean the pictures and annotate them, which is quite troublesome. I’ve used MaixHub and it’s pretty good, ...

I think that going through this process is the only way to truly train the AI model.

This post is from Embedded System

Comments

I don't quite agree. Preparing data sets is actually a boring and repetitive task. It's meaningless. The core of AI is the algorithm, which is how to make the recognition matrix. Including how to reduce noise, various transformations after noise reduction, and how to extract edges, feature points, etc., which are studied by the big guys.   Details Published on 2024-6-24 16:38
I don't quite agree. Preparing data sets is actually a boring and repetitive task. It's meaningless. The core of AI is the algorithm, which is how to make the recognition matrix. Including how to reduce noise, various transformations after noise reduction, and how to extract edges, feature points, etc., which are studied by the big guys.   Details Published on 2024-6-24 16:37
 
 
 

4764

Posts

12

Resources
15
 
wangerxian posted on 2024-6-24 15:21 I think that only by going through this process can we really train the AI model.

I don't quite agree.

Preparing the data set is actually tedious and repetitive work, which is meaningless.

The core of AI is the algorithm, which is how to create the recognition matrix.

Including how to reduce noise, various transformations after noise reduction, taking edges, taking feature points, etc., which are studied by the big guys.

This post is from Embedded System
 
 
 

4764

Posts

12

Resources
16
 
wangerxian posted on 2024-6-24 15:21 I think that only by going through this process can we really train the AI model.

As for making training sets and test sets, and annotating and scaling images, these are all done by interns.

This post is from Embedded System
 
 
 

Guess Your Favourite
Just looking around
Find a datasheet?

EEWorld Datasheet Technical Support

EEWorld
subscription
account

EEWorld
service
account

Automotive
development
circle

Copyright © 2005-2024 EEWORLD.com.cn, Inc. All rights reserved 京B2-20211791 京ICP备10001474号-1 电信业务审批[2006]字第258号函 京公网安备 11010802033920号
快速回复 返回顶部 Return list