this post was submitted on 13 Nov 2023
-6 points (44.4% liked)

Technology

59207 readers
3474 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 10 points 1 year ago (18 children)

Ok, if you take any book, keep it closed, how many times do the letters s, q, d and r appear in the book?

There is no way to know without opening the book and counting, sure, you could make some statisticsl analysis based on the language used, but that doesn't take into account the font size and spacing, nor the number of pages.

Since the machine only has a photo to analyze, it can only give extremely generic results, making them effectively useless.

You would need to open the food up and actually analyze a part of the inside with something like a mass spectrometer to get any useful data.

[–] [email protected] 7 points 1 year ago* (last edited 1 year ago) (9 children)

I agree with you, but disagree with your reasoning.

If you take 1lb of potatoes, boil and mash them with no other add-ins, you can reasonably estimate the nutritional information through visual inspection alone, assuming you have enough reference to see there is about a pound of potatoes. There are many nutrition apps out there that utilize this, and it’s essentially just lopping off the extremes and averaging out the rest.

The problem with this is, it’s impossible to accurately guess the recipe, and therefore the ingredients. Take the aforementioned mashed potatoes. You can’t accurately tell what variety of potato was used. Was water added back during the mashing? Butter? Cream cheese? Cheddar? Sour cream? There’s no way to tell visually, assuming uniform mashing, what is in the potatoes.

Not to mention, the pin sees two pieces of bread on top of each other… what is in the bread? Who the fuck knows!

[–] [email protected] 0 points 1 year ago (7 children)

I see what you mean, and while you raise a few excellent points, you seem to forget that a human looking at mashed potatoes have far more data than a computer lookkng at an image.

A human get data about smell, temperature texture and weight in addition to a simple visual impression.

This is why I picked a book/letter example, I wanted to reduce the variables available to a human to get closer to what a computer has from a photo.

[–] [email protected] 7 points 1 year ago* (last edited 1 year ago) (1 children)

It needn’t be exact. A ballpark calorie/sugar that’s 90% accurate would be sufficient. There’s some research that suggests that’s possible: https://arxiv.org/pdf/2011.01082.pdf

[–] [email protected] 1 points 1 year ago (1 children)

But what use would it be then, you wouldn't be able to compare one potato to another, both would register the same values.

[–] [email protected] 3 points 1 year ago (1 children)

I think the use case is not people doing potato study but people that want to lose weight and need to know the amount of calories in the piece of cake that’s offered at the office cafeteria.

[–] [email protected] 1 points 1 year ago (1 children)

And that means the feature is useless, there are so many things in a cake that can't be seen from a simple picture.

And if it is just a generic "cake" value, it will show incorrect data

[–] [email protected] 2 points 1 year ago

The paper I showed earlier disagrees

load more comments (5 replies)
load more comments (6 replies)
load more comments (14 replies)