Looking for a video on quicksync performance impact of iGPU passthrough

The Hobbyist@lemmy.zip · 17 days ago

It’s on the very first page, opposite to the office server page, and they acknowledge the Author does not exist and that it’s basically an ad for Windows server.

The Hobbyist@lemmy.zip · edit-2 2 months ago

@demigodrick@lemmy.zip

Perhaps of interest? I don’t know how many bots you’re facing.

The Hobbyist@lemmy.zip · edit-2 2 months ago

I feel you are a bit out of touch when the topic is specifically enshittification and that it is based on the history of companies turning against their users, showing little good faith. It is also not something which is sparing open source projects (remember bitwarden’s attempt?). So sure, I’m not going to deny that I’m making assumptions and that I am concerned it may one day happen. But it is grounded in reality, not some tinfoil hat stuff.

Edit: and the fact that bitwarden did not eventually go through with it does not counter the fact that they intended to and tried. Sometimes companies back off and play the long game and try to be more subtle about it.

The Hobbyist@lemmy.zip · edit-2 2 months ago

Tailscale has an employee who is contributing to headscale. I think this is helpful and they could decide to stop this collaboration the moment they feel it is counter productive.
they may decide to start adding undocumented/proprietary/“secure” elements which prevent headscale from working.

There is no guarantee headscale can keep working the way it does or that it is allowed to keep existing.

Edit: FYI headscale is not at all at feature parity with what tailscale offers.

The Hobbyist@lemmy.zip · edit-2 3 months ago

Congrats! Amazing project, exciting interface and you went the extra mile on the integration side with third parties. Kudos!

Edit: I’ll definitely have to try it out!

The Hobbyist@lemmy.zip · 3 months ago

Perhaps give Ramalama a try?

https://github.com/containers/ramalama

The Hobbyist@lemmy.zip · 3 months ago

Indeed, Ollama is going a shady route. https://github.com/ggml-org/llama.cpp/pull/11016#issuecomment-2599740463

I started playing with Ramalama (the name is a mouthful) and it works great. There is one or two more steps in the setup but I’ve achieved great performance and the project is making good use of standards (OCI, jinja, unmodified llama.cpp, from what I understand).

Go and check it out, they are compatible with models from HF and Ollama too.

https://github.com/containers/ramalama

The Hobbyist@lemmy.zip · 7 months ago

Would you be able to share more info? I remember reading their issues with docker, but I don’t recall reading about whether or what they switched to. What is it now?

The Hobbyist@lemmy.zip · edit-2 8 months ago

Regarding photos, and videos specifically:

I know you said you are starting with selfhosting so your question was focusing on that, but I would like to also share my experience with ente which has been working beautifully for my family, partner and myself. They are truly end to end encrypted, with the source code available on github.

They have reasonable prices. If you feel adventurous you can actually also host it yourself. They have advanced search features and face recognition which all run on device (since they can’t access your data) and it works very well. They have great sharing and collaborating features and don’t lock features behind accounts so you can actually gather memories from people on your quota by just sharing a link. You can also have a shared family plan.

The Hobbyist@lemmy.zip · edit-2 8 months ago

Ollama is very useful but also rather barebones. I recommend installing Open-Webui to manage models and conversations. It will also be useful if you want to tweak more advanced settings like system prompts, seed, temperature and others.

You can install open-webui using docker or just pip, which is enough if you only care about serving yourself.

Edit: open-webui also renders markdown, which makes formatting and reading much more appealing and useful.

Edit2: you can also plug ollama into continue.dev, an extension to vscode which brings the LLM capabilities to your IDE.

The Hobbyist@lemmy.zip · 9 months ago

Seems the chapter for Jellyfin has been “coming soon” for 3 years, too bad.

https://docs.ombi.app/settings/jellyfin/

The Hobbyist@lemmy.zip · edit-2 9 months ago

I’m not saying it’s not true, but nowhere on that page is there the word donation. And if it is, the fact that it is described and a license, tied to a server or a user causes a lot of confusion to me, especially when combined with the fact that there is no paywall but that it requires registration.

Why use the term license, server and user? Why not simply say donation and with the option of displaying the support by getting exclusive access to a badge like signal does?

Again, I’m very happy immich is free, it is great software and it deserves support but this is just super confusing to me and the buy.immich.app link does not clarify things nor does that blog post.

Edit: typo

The Hobbyist@lemmy.zip · 9 months ago

Hi and thank you so much for the fantastic work on Immich! I’m hoping to get a chance to try it out soon, with the first stable release!

One question on the financial support page: is it not a donation? There is a per server and a per user purchase, but I thought immich was exclusively self hosted, is it not? Or is this more like a way to say thanks while giving some hints as to how immich is being used privately? Or is there a way to actually pay to have immich host a server for one?

Thanks for clarifying!

The Hobbyist@lemmy.zip · 10 months ago

This is the way.

The Hobbyist@lemmy.zip · 10 months ago

I hear you, but how much time was Synology given? If it was no time at all (which it seems is what happened here??), that does not even give Synology a chance and that’s what I’m concerned with. If they get a month (give or take), then sure, disclose it and too bad for them if they don’t have a fix, they should have taken it more seriously, but I’m wondering about how much time they were even given in this case.

The Hobbyist@lemmy.zip · 10 months ago

Was it that the talk was a last minute change (replacing another scheduled talk) so the responsible disclosure was made in a rush without giving synology more time to provide the patch before the talk was presented?

If so, who decided it was a good idea to present something regarding a vulnerability without the fix being available yet?

The Hobbyist@lemmy.zip · 10 months ago

I’m not sure, I read that ZFS can help in the case of ransomware, so I assumed it would extend to accidental formatting but maybe there’s a key difference.

The Hobbyist@lemmy.zip · edit-2 10 months ago

I think these kind of situations are where ZFS snapshots shine: you’re back in a matter of seconds with no data loss (assuming you have a recent snapshot before the mistake).

Edit: yeah no, if you operate at the disk level directly, no local ZFS snapshot could save you…

The Hobbyist@lemmy.zip · 11 months ago

I didn’t say it can’t. But I’m not sure how well it is optimized for it. From my initial testing it queues queries and submits them one after another to the model, I have not seen it batch compute the queries, but maybe it’s a setup thing on my side. vLLM on the other hand is designed specifically for the multi co current user use case and has multiple optimizations for it.

The Hobbyist@lemmy.zip · edit-2 11 months ago

I run the Mistral-Nemo(12B) and Mistral-Small (22B) on my GPU and they are pretty code. As others have said, the GPU memory is one of the most limiting factors. 8B models are decent, 15-25B models are good and 70B+ models are excellent (solely based on my own experience). Go for q4_K models, as they will run many times faster than higher quantization with little performance degradation. They typically come in S (Small), M (Medium) and (Large) and take the largest which fits in your GPU memory. If you go below q4, you may see more severe and noticeable performance degradation.

If you need to serve only one user at the time, ollama +Webui works great. If you need multiple users at the same time, check out vLLM.

Edit: I’m simplifying it very much, but hopefully should it is simple and actionable as a starting point. I’ve also seen great stuff from Gemma2-27B

Edit2: added links

Edit3: a decent GPU regarding bang for buck IMO is the RTX 3060 with 12GB. It may be available on the used market for a decent price and offers a good amount of VRAM and GPU performance for the cost. I would like to propose AMD GPUs as they offer much more GPU mem for their price but they are not all as supported with ROCm and I’m not sure about the compatibility for these tools, so perhaps others can chime in.

Edit4: you can also use openwebui with vscode with the continue.dev extension such that you can have a copilot type LLM in your editor.

The Hobbyist@lemmy.zip · 2 years ago

Looking for a video on quicksync performance impact of iGPU passthrough

The Hobbyist@lemmy.zip · edit-2 2 years ago

ZFS dataset configuration for a movies and tv shows library? Very heterogeneous data

The Hobbyist

Looking for a video on quicksync performance impact of iGPU passthrough

Looking for a video on quicksync performance impact of iGPU passthrough

ZFS dataset configuration for a movies and tv shows library? Very heterogeneous data

ZFS dataset configuration for a movies and tv shows library? Very heterogeneous data