有些模型譬如Meta的Code Llama-Instruct是用Supervised Learning再加RLHF。

来源: 大观园的贾探春于 2025-02-05 17:11:26 [档案] [博客] [旧帖] [给我悄悄话] 阅读数 : (0 bytes)

您的位置：文学城 » 论坛 » 名校|爬藤 » 麻省 » 有些模型譬如Meta的Code Llama-Instruct是用Supervised Learning再加RLHF。

WENXUECITY.COM does not represent or guarantee the truthfulness, accuracy, or reliability of any of communications posted by other users.