There was a post asking people their opinions about Edge and many people seemed to liked the idea of Edge and seemed to be ok having it on Linux (Blasphemy)

Also, can we all agree how fast Edge went from joke to a threat? I mean, it’s good now alright! It was good back then, but it’s better now. Money man!!! Money! Personally I hate MS, but I can’t help but see the fact that there is no alternative to Bing GPT and many features Bing offers on Linux.

If there is an open source ChatGPT how would it look? Who would bear the costs? How would we solve the server problem? i.e., it would take a ton of server space and bandwidth. Just wondering.

I am pretty sure MS products will improve greatly due to their integration with GPT what do us poor folks on Linux do?

Just want to know the answers, I don’t want to discuss (aka can’t comment, I need to study), but just curious!

  • db0
    link
    fedilink
    21
    edit-2
    2 years ago

    As others said you can host Koboldai on your own, and if you don’t have a powerful GPU, anyone can use powerful llm models via the AI horde

  • @[email protected]
    link
    fedilink
    42 years ago

    At work today, I just heard the term, “Federated Machine Learning” - where individual corporations keep their proprietary learning data, but make the models shareable. I wonder if something similar could be done with activitypub?

    (Feel free to steal this idea if it’s worth stealing)

    • @[email protected]
      link
      fedilink
      English
      62 years ago

      This doesn’t work when an important part of the process is making sure your data is actually good. If the data is proprietary, there is no way to make sure it is usable.

  • @[email protected]
    link
    fedilink
    22 years ago

    Like, have you even tried a search or even attempted to look up anything? There is a lot. Most of it boils down to stay under free limits of street paying for compute and run your own. It’s actually very trivial to run a slightly low effective model on a 4090. Liked you can be up and running inside a few minutes if you already know pip

    • @[email protected]
      link
      fedilink
      132 years ago

      Such a wildly helpful comment. Keep on perpetuating the type of attitude that this community is known for!

  • @[email protected]
    link
    fedilink
    13
    edit-2
    2 years ago

    petals.dev It’s basically bittorrent for AI. A public swarm for running and training LLMs.

  • Infiltrated_ad8271
    link
    fedilink
    62 years ago

    For images there are foss options that you can run locally, like stable diffusion which is so good that it rivals its proprietary counterparts.
    But for text it’s a horror, there are some you can try (see gpt4all), but in general chatgpt has no real competition; the foss options are currently very bad, and even the proprietary options from big corporations like bard or llama are pitiful.

    but I can’t help but see the fact that there is no alternative to Bing GPT

    Regarding this, there are other services that also use GPT for search, here a couple:

    https://www.perplexity.ai/

    https://www.phind.com/

  • @[email protected]
    link
    fedilink
    32
    edit-2
    2 years ago

    what do us poor folks on Linux do?

    Run llama.cpp and any of the models listed here, that stuff has been around for months.

    TheBloke has a lot of models converted to GGUF format which you need for llama.cpp.

    Quick Start Guide (requires Nix, otherwise compile llama.cpp manually):

    $ GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/TheBloke/guanaco-7B-GGUF
    $ cd guanaco-7B-GGUF
    $ git lfs pull --include=Guanaco-7B.Q4_0.gguf
    $ nix run github:ggerganov/llama.cpp -- -m Guanaco-7B.Q4_0.gguf --instruct
    > Write haiku about a penguin
     A penguin walks on ice,
     Takes a plunge in the sea,
     Hides his feet from me!
    
    • 257m
      link
      fedilink
      12 years ago

      I ran it on my pc with a gtx 1070 with cuda enabled and compiled with the cuda compile hint but it ran really slowly how do you get it to run fast?

      • @[email protected]
        link
        fedilink
        12 years ago

        To make use of GPU acceleration you have to compile it with the proper support (CUDA, OpenCL, ROCM) and add --gpu-layers 16 (or a larger number, however much your VRAM can handle). If that’s not enough, than the GPU/CPU is probably to slow.

        You can try a smaller model, those run faster, but give worse results.

  • The Doctor
    link
    fedilink
    English
    192 years ago

    There are quite a few of them out there. Just a few from my notes (in reverse chronological order)

    The self-hosted AI/ML system has been here for a while. Granted, the vast majority of them require downloading pre-trained models before they can be used due to how much it costs to build a system of weights from scratch.

    I sometimes wonder if it would be possible to build and train a truly open source model with BOINC or something. The last 30 years of history show that it’s entirely feasible to build a massively distributed computing cluster, why not leverage this to build a model? I know how naive that sounds immediately after writing it, mostly because of the difficulty of getting a large enough training data set, which unfortunately has risk written all over it (read: people poisoning the model, ala Microsoft’s experiment with Tay on birbsite some years back).

  • @[email protected]
    link
    fedilink
    202 years ago

    i don’t know a single thing about whatever these things your talking about are, but i just want to say, you’re the first person i’ve ever seen suggest that edge isn’t a joke.

  • @[email protected]
    link
    fedilink
    English
    92 years ago

    I think it’s far more likely that easier solutions to hosting your own LLM/ChatGPT/etc will be pushed, rather than someone (or a group of someones) offering and maintaining it - as you mentioned, that takes a lot of resources. Think about the issues that various Fediverse software has had in terms of keeping up with the load sometimes, and LLMs use way more than most Fediverse applications. Especially if you want it to be as snappy as the current ones already out there.