Selfhosted
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don't control.
Rules:
-
Be civil: we're here to support and learn from one another. Insults won't be tolerated. Flame wars are frowned upon.
-
No spam posting.
-
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it's not obvious why your post topic revolves around selfhosting, please include details to make it clear.
-
Don't duplicate the full text of your blog or github here. Just post the link for folks to click.
-
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
-
No trolling.
Resources:
- selfh.st Newsletter and index of selfhosted software and apps
- awesome-selfhosted software
- awesome-sysadmin resources
- Self-Hosted Podcast from Jupiter Broadcasting
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
view the rest of the comments
Have you found much practical use for small models yet? I love the idea that even the 1.1B tinyllama model can run on my phone, but haven't found much real world use for it yet. Llama3 8b feels better, but not much better for even emails as it's a bit dumb
I use my phone all the time, but I just use a wireguard VPN to tunnel into my home container of Open WebUI. Then I can interact with my desktop machine using a NVIDIA gpu. I'm currently testing mistral-nemo. It's pretty great but it gets a bit verbose sometimes.
I am also using open webui. Most LLMs are too verbose for me, so I created a model in open-webui with system prompt "Do not repeat the questions. Avoid giving lists as answers. Do not summarize the answer at the end. If asked a follow-up question, respond with only new information, do not repeat previously stated information." and named it No Nonsense.
That's really smart. I just found out about fabric yesterday and it is helping me with things like what you stated. Prompt engineering is a huge thing.
for some reason chatgpt responds well to “no yapping”
Imo it's worthwhile to just run the biggest model available and rent expensive GPU time. It still amounts to very little overall and you get much better results. Project dependent of course