• No-Cod6872@alien.topB
    link
    fedilink
    English
    arrow-up
    1
    ·
    1 year ago

    Definitely interested, I loved C#/.NET but haven’t used it much in recent years. Have you used python for inference? I’m wondering how it compares to C# for LLM related programming

    • Key-Comparison3261@alien.topOPB
      link
      fedilink
      English
      arrow-up
      1
      ·
      1 year ago

      You have exllama, vllm, lmdeploy in python. And in most cases fastapi is used for serving an http endpoint.

      I wrote llm-sharp just for dropping python (GIL, pip deps) and getting flexible adaptation to dynamic model structures apart from standard llama.