Deepseek llm 67b Chat & Base

No-Link-2778@alien.top · 2 years ago

Deepseek llm 67b Chat & Base

Independent_Key1940@alien.top · 2 years ago

I asked it to create a simple chat interface to talk with open ai’s gpt 3.5 api and to use stream = true option. On the first try, it didn’t know how to handle the stream, so it simply used res.json(). After that, I told it that we needed to take care of streaming text in a special way. It understood this and wrote the correct code. Overall, I’m quite impressed. Way to go deepseak coder!

FullOf_Bad_Ideas@alien.top · 2 years ago

Grant of Copyright License. Subject to the terms and conditions of this License, DeepSeek hereby grants to You a perpetual, worldwide, non-exclusive, no-charge, royalty-free, irrevocable copyright license to reproduce, prepare, publicly display, publicly perform, sublicense, and distribute the Complementary Material, the Model, and Derivatives of the Model.

I really really enjoy seeing perpetual irrevocable licenses.

harrro@alien.top · 2 years ago

It’s nice to see this when every other ToS we click through says the reverse…

“By using this service, you grant Meta/Google/Microsoft a perpetual, royalty free right to reprint, reproduce and use your content”.

ab2377@alien.top · 2 years ago

if someone converts to gguf files and uploads them before bloke please post here thanks. (looking for q8).

sahil1572@alien.top · 2 years ago

its already there https://huggingface.co/TheBloke/deepseek-llm-67b-chat-GGUF/tree/main

RayIsLazy@alien.top · 2 years ago

their code models are extremely good so I have high hopes for these.

a_beautiful_rhind@alien.top · 2 years ago

Does it give refusals on base? 67B sounds like full foundation train.

farkinga@alien.top · 2 years ago

GGUF via TheBloke:

https://huggingface.co/TheBloke/deepseek-llm-67b-chat-GGUF/

Lance_lake@alien.top · 2 years ago

not that censored on local.

So… Some censoring?

oobabooga4@alien.top · 2 years ago

I’m desensitized at this point. I wonder if this is yet another Pretraining on the Test Set Is All You Need marketing stunt or not, as most new models lately have been.

Neologismus@alien.top · 2 years ago

I threw my reasoning test questions at the web version and it performed worse than most 70B i tried. About the level of Yi.

AntoItaly@alien.top · 2 years ago

Wow, this model seems very good for the Italian language!

eachcitizen100@alien.top · 2 years ago

I wish there was a 13b model which can just fit in on my GPU with quant

pseudonerv@alien.top · 2 years ago

The chat model is the first that knows how to compare the weight of bricks and feathers.

The weight of an object is determined by its mass and the gravitational force acting on it. In this case, both objects are being compared under the same gravitational conditions (assuming they’re both on Earth), so we can compare their masses directly to determine which weighs more.

1kg of bricks has a mass of 1 kilogram. 2kg of feathers has a mass of 2 kilograms.

Since 2 is greater than 1, the 2kg of feathers weigh more than the 1kg of bricks.

Beb_Nan0vor@alien.top · 2 years ago

That coding is pretty damn good based off of limited tests. I’ll have to experiment more.

quantomworks@alien.top · 2 years ago

I made it write about itself using LocalAI https://sfxworks.net/posts/deepseek/

I will post a how-to on using local-ai on my free time if anyone is interested

ab2377@alien.top · 2 years ago

depepseek is one of my fav, i use it everyday for code generation. its got an extra option for the chat now at the link you shared, just general chat about anything, pretty good at it

uti24@alien.top · 2 years ago

Seems I am doing something wrong with this one.

I got abismal results with 4_K_M: it had silly grammatical errors and typos, it also did not stick to prompt, so I don’t know.

LocoLanguageModel@alien.top · 2 years ago

I don’t know if this helps but I’m using the GGUF version of that and it’s working perfectly

Deepseek llm 67b Chat &amp; Base

Deepseek llm 67b Chat &amp; Base

Deepseek llm 67b Chat & Base

Deepseek llm 67b Chat & Base