• 0 Posts
  • 4 Comments
Joined 2 years ago
cake
Cake day: June 3rd, 2023

help-circle


  • As I said, the architectural changes are quite cool

    As far as I’ve understood it mostly comes down to splitting it up into multiple expert systems, so you don’t need to activate the complete system with every request

    But I’ve only scratched the surface…

    Also, open source… The weights are made publicly available.
    None of the training data or systems

    Edit: regarding “open source”:
    Also Meta’s Llama is on huggingface, just like deepseek. I still wouldn’t talk about transparency here