DeepSeekR1 celebrates its first anniversary with the launch of its new model, MODEL1, on January 21st. This release is accompanied by significant updates to the company's FlashMLA code repository on GitHub, highlighting the advancements in their technology. According to the results published in the material, these updates are expected to enhance the performance and capabilities of their products.
Overview of MODEL1 in FlashMLA Repository
The MODEL1 has been referenced 28 times across 114 files in the updated FlashMLA repository, indicating its distinct features compared to the previous version, V32, which is confirmed to be DeepSeekV32. This suggests that MODEL1 may represent a new architectural approach for the company.
Notable Code Enhancements
Notable differences in the code include:
- enhancements in KV cache layout
- improved sparsity handling
- advancements in FP8 decoding
Additionally, multiple memory optimization adjustments have been implemented, showcasing DeepSeekR1's commitment to refining its technology and improving performance.
KNOT Technologies recently raised $1 million in pre-seed funding to enhance its AI-driven ticketing solution, a significant development in the industry. This follows DeepSeekR1's anniversary celebration and new model launch. For more details, see read more.







