Power User
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
PookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 1 year ago

Qwen-72B released

huggingface.co

external-link
message-square
39
fedilink
1
external-link

Qwen-72B released

huggingface.co

PookaMacPhellimen@alien.topB to LocalLLaMAEnglish · 1 year ago
message-square
39
fedilink
Qwen/Qwen-72B · Hugging Face
huggingface.co
external-link
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
  • PookaMacPhellimen@alien.topOPB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    https://preview.redd.it/sdofti9odg3c1.jpeg?width=1792&format=pjpg&auto=webp&s=d6f56d56c3596924ea61e1e5429018c0222907d2

    Amazing capabilities on some benchmarks if true.

    • Disastrous_Elk_6375@alien.topB
      link
      fedilink
      arrow-up
      1
      ·
      1 year ago

      big if true

    • a_slay_nub@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      Bit disappointed by the coding performance but it is a general use case model. It’s insane how good gpt 3.5 is for how fast it is.

      • ambient_temp_xeno@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        Apparently the chat version has about 64 for humaneval.

    • Secret_Joke_2262@alien.topB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      What do these tests mean for LLM? There are many values, and I see that in most cases qwen is better than gpt4. In others it is worse or much worse

      • rileyphone@alien.topB
        link
        fedilink
        English
        arrow-up
        1
        ·
        1 year ago

        All the cases it is better than GPT-4 are benchmarks involving Chinese language. OpenAI is going to have a hard time getting access to extensive Chinese language datasets so it’s not surprising a 72B model can beat GPT-4, though it’s still impressive in it’s own right.

LocalLLaMA

localllama

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !localllama@poweruser.forum

Community to discuss about Llama, the family of large language models created by Meta AI.

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 4 users / day
  • 4 users / week
  • 4 users / month
  • 4 users / 6 months
  • 1 local subscriber
  • 5 subscribers
  • 1.02K Posts
  • 5.82K Comments
  • Modlog
  • mods:
  • communick
  • BE: 0.19.8
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org