Questions tagged with Image Processing & Analytics
Sort by most recent
Browse through the questions and answers listed below or filter and sort to narrow down your results.
Is one supposed to grayscale and brightness contrast process the image before sending to textract?
Textract results on recognizing basic arithmetic seems to degrade with color This series of images show Textract **failing unusually in all cases except the one** where the image has been both grayscale and brightness/contrast (50/50 and 25/25) - [unedited image from the camera](https://i.gyazo.com/0c6d8126dff5269dbe089a090a9e9d26.png) FAIL - [brightness contrast applied without grayscale](https://gyazo.com/8e3cc523552449ff54b9ed8fdbe6594f) FAIL - [grayscale](https://gyazo.com/8e3cc523552449ff54b9ed8fdbe6594f) FAIL - [grayscale with brightness contrast] (https://i.gyazo.com/22269131293c8e7b50c7aee7b998554c.png) finally! Is one supposed to grayscale the image before sending to textract? Should one also apply brightness/contrast? I assume Textract was trained with grayscale images - so should the service automatically convert the input images to grayscale?
AI used for recognizing emojis on wallpaper
I want to build an app that will recognize what emojis have been used on the wallpaper. So for instance this app will receive an input image like [this](https://i.stack.imgur.com/5wRQm.png) And on output should return an array of names of recognized emojis: ``` [ "Smiling Face with Sunglasses", "Grinning Face with Smiling Eyes", "Kissing Face with Closed Eyes" ] ``` Of course, the names of these emojis will come from the names of files of training images. For example, [this](https://i.stack.imgur.com/BaEGG.png) file, will be called `Grinning_Face_with_Smiling_Eyes.jpg` I would like to use `AWS Rekognition Label`, but they require a minimum of 10 images of each emoji for training. As you know, I can only provide one image of each emoji, because there is no more option, they are in 2D ;) Now my question is: What should I do? How can I skip these requirements? Which service should I choose? On the stack, I have read that I should for instance rotate each image 12 times by 30o, or crop the emoji by half. I have done that, but, precision is very small - around `0.3` PS. In real business instead of emojis, there are covers of the books, which AI has to recognize. There is also one image per book-cover photo in 2D.