this post was submitted on 11 Jun 2024
37 points (93.0% liked)

Android

27546 readers
539 users here now

DROID DOES

Welcome to the droidymcdroidface-iest, Lemmyest (Lemmiest), test, bestest, phoniest, pluckiest, snarkiest, and spiciest Android community on Lemmy (Do not respond)! Here you can participate in amazing discussions and events relating to all things Android.

The rules for posting and commenting, besides the rules defined here for lemmy.world, are as follows:

Rules


1. All posts must be relevant to Android devices/operating system.


2. Posts cannot be illegal or NSFW material.


3. No spam, self promotion, or upvote farming. Sources engaging in these behavior will be added to the Blacklist.


4. Non-whitelisted bots will be banned.


5. Engage respectfully: Harassment, flamebaiting, bad faith engagement, or agenda posting will result in your posts being removed. Excessive violations will result in temporary or permanent ban, depending on severity.


6. Memes are not allowed to be posts, but are allowed in the comments.


7. Posts from clickbait sources are heavily discouraged. Please de-clickbait titles if it needs to be submitted.


8. Submission statements of any length composed of your own thoughts inside the post text field are mandatory for any microblog posts, and are optional but recommended for article/image/video posts.


Community Resources:


We are Android girls*,

In our Lemmy.world.

The back is plastic,

It's fantastic.

*Well, not just girls: people of all gender identities are welcomed here.


Our Partner Communities:

[email protected]


founded 1 year ago
MODERATORS
 

i've been searching for a while but none of the ones i find are great, the one i landed on hasn't been updated in over a year

top 11 comments
sorted by: hot top controversial new old
[–] [email protected] 13 points 3 months ago (2 children)

https://github.com/Akylas/OSS-DocumentScanner

I use this app for scanning documents, I just tried sharing a picture to the app and running OCR on that picture, which also worked fine, so it should also fit your usecase.

[–] [email protected] 5 points 3 months ago

seems promising! thanks

[–] [email protected] 1 points 3 months ago

Idk if my handwriting is messy or what but the ocr comes up pretty terribly when I do it. Still probably the best pdf scanner nonetheless.

[–] [email protected] 6 points 3 months ago (1 children)
[–] [email protected] 5 points 3 months ago (1 children)

mostly i just use it to extract text from screenshots to use in image descriptions

[–] [email protected] 2 points 3 months ago

Like Firefox ScreenshotGo? (I think it only supports English though)

[–] [email protected] 3 points 3 months ago (2 children)

Not particularly Android related but I'm fairly certain you can do some OCR with Python. Question is whether you want to analyse an image file or straight into the camera. The latter might be a challenge.

[–] [email protected] 11 points 3 months ago

i have posted this in an android community because i want to do it with an android app :/
unless there's a way to easily run a python program with all the necessary dependencies on android this does not help me

[–] [email protected] 2 points 3 months ago (1 children)

no need for Python. there’s a Google SDK, ML Kit, that will do the heavy lifting on this. if that’s not acceptable, TensorFlow, PyTorch, and ONNX support Android, albeit not as nicely integrated.

your image processing pipeline will be imageSource -> RGB encoding -> OCR -> profit. your OCR just needs an RGB encoded image. doesn’t matter if that’s a JPEG or YUV video feed at the source.

as for if there’s an app that fits OP’s exact use case, dunno.

[–] [email protected] 2 points 3 months ago (1 children)

Man, you over complicated this task. OP, in F-Droid there are three apps that are based on a popular OCR Python library called tesseract. Just search for this term on F-Droid and give those apps a try.

[–] [email protected] 3 points 3 months ago

i mean, you’re right. i’m just saying it’s a little silly to ship a Python interpreter when there are easier, better supported ways to do the same thing.

looks like tesseract provides C bindings which are probably being utilized in those apps.