Technology

59207 readers

3474 users here now

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

founded 1 year ago

MODERATORS

[email protected]

Early impressions of Google's Gemini aren't great | TechCrunch (techcrunch.com)

submitted 11 months ago by [email protected] to c/[email protected]

14 comments fedilink hide all child comments

you are viewing a single comment's thread
view the rest of the comments

[–] [email protected] 2 points 11 months ago

This is the best summary I could come up with:

Science fiction author Charlie Stross found many more examples of confabulation in a recent blog post.

It seems Gemini Pro is loath to comment on potentially controversial news topics, instead telling users to… Google it themselves.

Interestingly, Gemini Pro did provide a summary of updates on the war in Ukraine when I asked it for one.

Google emphasized Gemini’s enhanced coding skills in a briefing earlier this week.

And, as with all generative AI models, Gemini Pro isn’t immune to “jailbreaks” — i.e. prompts that get around the safety filters in place to attempt to prevent it from discussing controversial topics.

Using an automated method to algorithmically change the context of prompts until Gemini Pro’s guardrails failed, AI security researchers at Robust Intelligence, a startup selling model-auditing tools, managed to get Gemini Pro to suggest ways to steal from a charity and assassinate a high-profile individual (albeit with “nanobots” — admittedly not the most realistic weapon of choice).

The original article contains 597 words, the summary contains 157 words. Saved 74%. I'm a bot and I'm open source!