xcjs

This profile is from a federated server and may be incomplete. View on remote instance

Advice - Getting started with LLMs

I'm new to the field of large language models (LLMs) and I'm really interested in learning how to train and use my own models for qualitative analysis. However, I'm not sure where to start or what resources would be most helpful for a complete beginner....

xcjs , 1 month ago

No offense intended, but are you sure it's using your GPU? Twenty minutes is about how long my CPU-locked instance takes to run some 70B parameter models.

On my RTX 3060, I generally get responses in seconds.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

Unfortunately, I don't expect it to remain free forever.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

Ok, so using my "older" 2070 Super, I was able to get a response from a 70B parameter model in 9-12 minutes. (Llama 3 in this case.)

I'm fairly certain that you're using your CPU or having another issue. Would you like to try and debug your configuration together?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

It should be split between VRAM and regular RAM, at least if it's a GGUF model. Maybe it's not, and that's what's wrong?

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

Good luck! I'm definitely willing to spend a few minutes offering advice/double checking some configuration settings if things go awry again. Let me know how things go. :-)

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago (edited 1 month ago)

I think there was a special process to get Nvidia working in WSL. Let me check... (I'm running natively on Linux, so my experience doing it with WSL is limited.)

https://docs.nvidia.com/cuda/wsl-user-guide/index.html - I'm sure you've followed this already, but according to this, it looks like you don't want to install the Nvidia drivers, and only want to install the cuda-toolkit metapackage. I'd follow the instructions from that link closely.

You may also run into performance issues within WSL due to the virtual machine overhead.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

We all mess up! I hope that helps - let me know if you see improvements!

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

maegul , 1 month ago to Fediverse

Nice demonstration of why mastodon's dominance is problematic

See the conversions here:
https://github.com/LemmyNet/lemmy/pull/4628
and
https://socialhub.activitypub.rocks/t/federating-the-content-of-posts-note-articles-and-character-limits/4087

AFAICT, mastodon's decisions, which are arguably problematic (on which see: https://lemmy.ml/post/14973403) are literally trickling down to other platforms and infecting how they federate with each other as they dance around mastodon's quirks in different ways.

It seems like masto is ruining "the standard" with its gravity.

#fediverse #mastodon
@fediverse

Reply

Expand (10)

Collapse (10)

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

xcjs , 1 month ago

It's a W3C managed standard, but there are tons of behavior not spelled out in the specification that platforms can choose to impose.

The standard doesn't impose a 500 character limit, but there's nothing that says there can't be a limit.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...