January 2025
Global Artificial Intelligence: Thoughts on DeepSeek and potential implications
DeepSeek is an artificial intelligence (AI) model that has recently garnered significant attention due to its efficiency and cost-effectiveness. The China-based AI company built the open-sourced model using a mixture-of-experts (MoE) training on other leading models and an optimized architecture that only uses a subset of the model’s parameters for each input. This reduces computational costs while performing on par with other leading large language models (LLMs) for simple use cases