httpjames

joined 1 year ago
[–] [email protected] 13 points 2 months ago

Spam 1 if we should be worried

 

Each LLM is given the same 1000 chess puzzles to solve. See puzzles.csv. Benchmarked on Mar 25, 2024.

Model Solved Solved % Illegal Moves Illegal Moves % Adjusted Elo
gpt-4-turbo-preview 229 22.9% 163 16.3% 1144
gpt-4 195 19.5% 183 18.3% 1047
claude-3-opus-20240229 72 7.2% 464 46.4% 521
claude-3-haiku-20240307 38 3.8% 590 59.0% 363
claude-3-sonnet-20240229 23 2.3% 663 66.3% 286
gpt-3.5-turbo 23 2.3% 683 68.3% 269
claude-instant-1.2 10 1.0% 707 66.3% 245
mistral-large-latest 4 0.4% 813 81.3% 149
mixtral-8x7b 9 0.9% 832 83.2% 136
gemini-1.5-pro-latest* FAIL - - - -

Published by the CEO of Kagi!

[–] [email protected] 20 points 9 months ago (1 children)

I'm part of the Ente team. Thanks for letting us know. I've passed this along.

[–] [email protected] 51 points 10 months ago (6 children)

Reboot mid flight is a funny solution

[–] [email protected] 5 points 10 months ago

Joins must be a pain in the ass with hooks

[–] [email protected] 11 points 11 months ago* (last edited 11 months ago)

I've spent over 2,000 hours on YouTube this year alone and am in the target demographic of this study. I watch a lot of videos in the background while I work, commute, or just chill, to keep myself stimulated.

Although not all of the content I watch is necessarily educational, a grand majority of it is. Whenever there's a science video in my feed, I'll probably click it. I'm subscribed to Veritasum, TED, Vox, No Boilerplate, etc.

[–] [email protected] 5 points 11 months ago (2 children)

The person on the right of the VPN image is the destination server

[–] [email protected] 2 points 11 months ago

They partnered with Anthropic and that seems to be going, fine I guess? But Anthropic's models definitely need work.

[–] [email protected] 3 points 11 months ago (5 children)

Most come with DNS blocklists now that can prevent you from accessing it

[–] [email protected] 2 points 1 year ago

GPT 4 Turbo is actually much better than GPT 3.5 and 4 for coding. It has a way better understanding of design now.

[–] [email protected] 2 points 1 year ago

As much as I'd love to see them back in OpenAI, I don't think Emmett Shear will give up.

I have a soft spot for Greg since he was the one who introduced the world to GPT 4 on that developer livestream

[–] [email protected] 5 points 1 year ago (1 children)

JSON.stringify(joke)

You have to tell us what it was now 😏

 
 

I'm paying for a VPN service that has a limited number of concurrent devices but I want to use it on all of my devices. Is there a way to self-host a Wireguard VPN on my Linux server that will forward all WAN traffic to my third-party VPN provider? Ideally, I would generate a Wireguard config for this gateway, and all my devices would connect to my local VPN gateway server, thus allowing me to share that one config across all devices.

My router does not support VPN configuration and modifying its firmware is not an option.

67
submitted 1 year ago* (last edited 1 year ago) by [email protected] to c/[email protected]
 

This YouTuber went to Japan to travel for free by abusing people's hospitality and committing fare theft on public transportation.

 

Right now I’ve been using Tailscale because it automatically adapts to my network conditions. If I’m at home, it’ll prioritize local network connection, but when I’m out and about, it’ll automatically beam a direct connection or use a relay.

One gripe I have about it is I can’t run it alongside my normal VPNs on my mobile devices. I have to choose between one or the other.

I have tried Cloudflare Tunnel before, but using it for streaming, like Jellyfin, is forbidden. There’s also the added latency and slowness to having to hop through multiple DCs to reach Cloudflare and back.

view more: next ›