人工智慧熱潮

人工智慧熱潮（英語：AI boom）又稱人工智慧之春（英語：AI spring^[1]），指人工智慧領域的快速發展時期。該熱潮開始於2016年或2017年的OpenAI公司，生成式人工智慧為此次熱潮的重要組成部分。^[2]OpenAI推出的生成式人工智慧系統，如2018年推出的各類GPT以及2021年推出的DALL-E，皆對此次發展起到了推波助瀾的作用。^[3]^[4]^[5]

由於大型語言模型在2022年得到了提升，基於這一模型的聊天機器人得以成為現實。與此同時，文字到圖像生成模型所繪圖片與手工繪圖相比已經到了可以以假亂真的地步^[6]，而語音合成技術亦可成功的模仿人類講話。^[7]

自2022年底至2023年間，隨著科技巨擘在市場上站穩腳跟，數十個新AI網站和AI聊天機器人上線，導致AI工具的普及率空前增加。^[8]對於這一AI熱潮，民眾對此的反應可謂褒貶不一。有的人認為AI有助於開發人類潛力，為人類謀福祉。但有的人則認為AI會導致大量失業，並指出了目前的AI技術所存在諸多缺陷。^[9]^[10]^[11]^[12]

語言模型

GPT-3是OpenAI於2020年發布的大型語言模型，該模型可以生成高品質的類人文字，其品質幾可亂真。^[13]隨後OpenAI又發布了GPT-3.5版本，該版本被用於ChatGPT，因其能對多個領域的問題給出清晰回答而廣受關注。^[14]隨後OpenAI又於2023年4月發布了GPT-4模型，目前該模型已被用於Microsoft Bing搜尋引擎。^[15]^[16]除了OpenAI以外，還有其他多個公司皆發布了自己的語言模型，如谷歌發布的LaMDA模型以及Meta發布的LLaMA模型。

文字到圖像生成模型

OpenAI發布的DALL-E為首批受廣泛關注的文字到圖像生成模型之一，該模型發布於2021年1月。^[17]隨後能夠生成更為逼真圖像的DALL-E 2於2022年4月發布，^[18]而其開源替代版本Stable Diffusion則發布於2022年8月。^[19]

繼文字到圖像生成模型之後，又湧現了多個受語言模型驅動的文字到影片生成模型，如DAMO、^[20] Make-A-Video、^[21]Imagen Video^[22]以及Phenaki^[23]，這些模型可通過文字或文字/圖像提示自動生成影片。^[24]

語音合成

15.ai是首批開放使用的語言合成軟體，該產品發布於2020年3月，能夠讓人們通過輸入音訊媒體的方式產生對其聲音的模仿。^[25]^[26]隨後ElevenLabs推出了一個允許公眾上傳其聲音的網站，用於語音合成。不過由於該軟體能夠讓使用者模仿知名人士的聲音並利用此發布假聲名，ElevenLabs受到了各界譴責。^[27]除此之外，這一技術引發了民眾對其應用於深度偽造的擔憂。^[28]在有人利用語音合成技術對德雷克與威肯的聲音進行音樂創作後，不少民眾對該技術的合法性及道德提出了質疑。^[29]

參考文獻

^ Bommasani, Rishi. AI Spring? Four Takeaways from Major Releases in Foundation Models. Stanford Institute for Human-Centered Artificial Intelligence. 2023-03-17 [2023-05-16]. （原始內容存檔於2023-05-07）.
^ Why am I not terrified of AI?. Shtetl-Optimized. 2023-03-06 [2023-03-19]. （原始內容存檔於2023-05-12）（美國英語）.
^ Newman, Daniel. Exploring The Ins And Outs Of The Generative AI Boom. Forbes. [2023-03-14]. （原始內容存檔於2023-03-28）（英語）.
^ The AI boom: lessons from history | The Economist. The Economist. 2023-03-13 [2023-03-15]. （原始內容存檔於2023-03-13）.
^ Kafka, Peter. The AI boom is here, and so are the lawsuits. Vox. 2023-02-01 [2023-03-15]. （原始內容存檔於2023-05-09）（英語）.
^ Vincent, James. All these images were generated by Google's latest text-to-image AI. The Verge. 2022-05-24 [2023-03-15]. （原始內容存檔於2023-02-15）（美國英語）.
^ AI-Generated Voice Firm Clamps Down After 4chan Makes Celebrity Voices for Abuse. www.vice.com. [2023-03-15]. （原始內容存檔於2023-05-07）（英語）.
^ Firth-Butterfield, Kay. 2022 was a big year for AI development. In 2023, we must decide how best to use it. Asia News Network. 2023-01-18 [2023-05-16]. （原始內容存檔於2023-03-19）.
^ No matter how sophisticated, artificial intelligence systems still need human oversight. ZDNET. [2023-05-16]. （原始內容存檔於2023-05-10）（英語）.
^ Sukhadeve, Ashish. Council Post: Artificial Intelligence For Good: How AI Is Helping Humanity. Forbes. [2023-05-16]. （原始內容存檔於2023-05-09）（英語）.
^ Could AI advancements be a threat to your job security? | Learning People. www.learningpeople.com. [2023-05-16]. （原始內容存檔於2023-05-09）.
^ Zinkula, Aaron Mok, Jacob. ChatGPT may be coming for our jobs. Here are the 10 roles that AI is most likely to replace.. Business Insider. [2023-05-16]. （原始內容存檔於2023-05-09）.
^ Sagar, Ram. OpenAI Releases GPT-3, The Largest Model So Far. Analytics India Magazine. 2020-06-03 [2023-03-15]. （原始內容存檔於2020-08-04）（美國英語）.
^ Lock, Samantha. What is AI chatbot phenomenon ChatGPT and could it replace humans?. The Guardian. 2022-12-05 [2023-03-15]. ISSN 0261-3077. （原始內容存檔於2023-01-16）（英國英語）.
^ Lardinois, Frederic. Microsoft's new Bing was using GPT-4 all along. TechCrunch. 2023-03-14 [2023-03-15]. （原始內容存檔於2023-03-15）（美國英語）.
^ OpenAI announces ChatGPT successor GPT-4. BBC News. 2023-03-14 [2023-03-15]. （原始內容存檔於2023-05-15）（英國英語）.
^ Coldewey, Devin. OpenAI's DALL-E creates plausible images of literally anything you ask it to. TechCrunch. 2021-01-05 [2023-03-15]. （原始內容存檔於2021-01-06）（美國英語）.
^ Coldewey, Devin. New OpenAI tool draws anything, bigger and better than ever. TechCrunch. 2022-04-06 [2023-03-15]. （原始內容存檔於2023-05-06）（美國英語）.
^ Stable Diffusion Public Release. Stability AI. [2023-03-15]. （原始內容存檔於2022-08-30）（英國英語）.
^ ModelScope 魔搭社区. modelscope.cn. [2023-03-20]. （原始內容存檔於2023-05-09）.
^ kumar, Ashish. Meta AI Introduces 'Make-A-Video': An Artificial Intelligence System That Generates Videos From Text. MarkTechPost. 2022-10-03 [2023-03-15]. （原始內容存檔於2022-12-01）（美國英語）.
^ Edwards, Benj. Google's newest AI generator creates HD video from text prompts. Ars Technica. 2022-10-05 [2022-10-25]. （原始內容存檔於2023-02-07）（美國英語）.
^ Phenaki. phenaki.video. [2022-10-03]. （原始內容存檔於2022-10-07）.
^ Edwards, Benj. Runway teases AI-powered text-to-video editing using written prompts. Ars Technica. 2022-09-09 [2022-09-12]. （原始內容存檔於2023-01-27）.
^ Zwiezen, Zack. Website Lets You Make GLaDOS Say Whatever You Want. Kotaku. Kotaku. 2021-01-18 [2021-01-18]. （原始內容存檔於2021-01-17）.
^ Ruppert, Liana. Make Portal's GLaDOS And Other Beloved Characters Say The Weirdest Things With This App. Game Informer (Game Informer). 2021-01-18 [2021-01-18]. （原始內容存檔於2021-01-18）.
^ Jorge Jimenez. AI company promises changes after 'voice cloning' tool used to make celebrities say awful things. PC Gamer. 2023-01-31 [2023-02-03]. （原始內容存檔於2023-04-04）（英語）.
^ Seeing is believing? Global scramble to tackle deepfakes. Yahoo News. [2023-05-16]. （原始內容存檔於2023-02-03）.
^ Coscarelli, Joe. An A.I. Hit of Fake ‘Drake’ and ‘The Weeknd’ Rattles the Music World. The New York Times. 2023-04-19 [2023-05-16]. ISSN 0362-4331. （原始內容存檔於2023-05-15）（美國英語）.

[1] Bommasani, Rishi. AI Spring? Four Takeaways from Major Releases in Foundation Models. Stanford Institute for Human-Centered Artificial Intelligence. 2023-03-17 [2023-05-16]. （原始內容存檔於2023-05-07）.

[2] Why am I not terrified of AI?. Shtetl-Optimized. 2023-03-06 [2023-03-19]. （原始內容存檔於2023-05-12）（美國英語）.

[3] Newman, Daniel. Exploring The Ins And Outs Of The Generative AI Boom. Forbes. [2023-03-14]. （原始內容存檔於2023-03-28）（英語）.

[4] The AI boom: lessons from history | The Economist. The Economist. 2023-03-13 [2023-03-15]. （原始內容存檔於2023-03-13）.

[5] Kafka, Peter. The AI boom is here, and so are the lawsuits. Vox. 2023-02-01 [2023-03-15]. （原始內容存檔於2023-05-09）（英語）.

[6] Vincent, James. All these images were generated by Google's latest text-to-image AI. The Verge. 2022-05-24 [2023-03-15]. （原始內容存檔於2023-02-15）（美國英語）.

[7] AI-Generated Voice Firm Clamps Down After 4chan Makes Celebrity Voices for Abuse. www.vice.com. [2023-03-15]. （原始內容存檔於2023-05-07）（英語）.

[8] Firth-Butterfield, Kay. 2022 was a big year for AI development. In 2023, we must decide how best to use it. Asia News Network. 2023-01-18 [2023-05-16]. （原始內容存檔於2023-03-19）.

[9] No matter how sophisticated, artificial intelligence systems still need human oversight. ZDNET. [2023-05-16]. （原始內容存檔於2023-05-10）（英語）.

[10] Sukhadeve, Ashish. Council Post: Artificial Intelligence For Good: How AI Is Helping Humanity. Forbes. [2023-05-16]. （原始內容存檔於2023-05-09）（英語）.

[11] Could AI advancements be a threat to your job security? | Learning People. www.learningpeople.com. [2023-05-16]. （原始內容存檔於2023-05-09）.

[12] Zinkula, Aaron Mok, Jacob. ChatGPT may be coming for our jobs. Here are the 10 roles that AI is most likely to replace.. Business Insider. [2023-05-16]. （原始內容存檔於2023-05-09）.

[13] Sagar, Ram. OpenAI Releases GPT-3, The Largest Model So Far. Analytics India Magazine. 2020-06-03 [2023-03-15]. （原始內容存檔於2020-08-04）（美國英語）.

[14] Lock, Samantha. What is AI chatbot phenomenon ChatGPT and could it replace humans?. The Guardian. 2022-12-05 [2023-03-15]. ISSN 0261-3077. （原始內容存檔於2023-01-16）（英國英語）.

[15] Lardinois, Frederic. Microsoft's new Bing was using GPT-4 all along. TechCrunch. 2023-03-14 [2023-03-15]. （原始內容存檔於2023-03-15）（美國英語）.

[16] OpenAI announces ChatGPT successor GPT-4. BBC News. 2023-03-14 [2023-03-15]. （原始內容存檔於2023-05-15）（英國英語）.

[17] Coldewey, Devin. OpenAI's DALL-E creates plausible images of literally anything you ask it to. TechCrunch. 2021-01-05 [2023-03-15]. （原始內容存檔於2021-01-06）（美國英語）.

[18] Coldewey, Devin. New OpenAI tool draws anything, bigger and better than ever. TechCrunch. 2022-04-06 [2023-03-15]. （原始內容存檔於2023-05-06）（美國英語）.

[19] Stable Diffusion Public Release. Stability AI. [2023-03-15]. （原始內容存檔於2022-08-30）（英國英語）.

[20] ModelScope 魔搭社区. modelscope.cn. [2023-03-20]. （原始內容存檔於2023-05-09）.

[21] umar, Ashish. Meta AI Introduces 'Make-A-Video': An Artificial Intelligence System That Generates Videos From Text. MarkTechPost. 2022-10-03 [2023-03-15]. （原始內容存檔於2022-12-01）（美國英語）.

[22] Edwards, Benj. Google's newest AI generator creates HD video from text prompts. Ars Technica. 2022-10-05 [2022-10-25]. （原始內容存檔於2023-02-07）（美國英語）.

[23] Phenaki. phenaki.video. [2022-10-03]. （原始內容存檔於2022-10-07）.

[24] Edwards, Benj. Runway teases AI-powered text-to-video editing using written prompts. Ars Technica. 2022-09-09 [2022-09-12]. （原始內容存檔於2023-01-27）.

[kotaku-25] Zwiezen, Zack. Website Lets You Make GLaDOS Say Whatever You Want. Kotaku. Kotaku. 2021-01-18 [2021-01-18]. （原始內容存檔於2021-01-17）.

[gameinformer-26] Ruppert, Liana. Make Portal's GLaDOS And Other Beloved Characters Say The Weirdest Things With This App. Game Informer (Game Informer). 2021-01-18 [2021-01-18]. （原始內容存檔於2021-01-18）.

[27] Jorge Jimenez. AI company promises changes after 'voice cloning' tool used to make celebrities say awful things. PC Gamer. 2023-01-31 [2023-02-03]. （原始內容存檔於2023-04-04）（英語）.

[28] Seeing is believing? Global scramble to tackle deepfakes. Yahoo News. [2023-05-16]. （原始內容存檔於2023-02-03）.

[29] Coscarelli, Joe. An A.I. Hit of Fake ‘Drake’ and ‘The Weeknd’ Rattles the Music World. The New York Times. 2023-04-19 [2023-05-16]. ISSN 0362-4331. （原始內容存檔於2023-05-15）（美國英語）.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

[22]

[23]

[24]

[25]

[26]

[27]

[28]

[29]

閱論編
產品	ChatGPT DALL-E GitHub Copilot OpenAI Five（英語：OpenAI Five） Sora Whisper（英語：Whisper (speech recognition system)） SearchGPT
基礎模型	OpenAI Codex GPT家族 GPT-1 GPT-2 GPT-3 GPT-4 GPT-4o o1（英語：OpenAI o1）
相關人物	薩姆·阿爾特曼葛瑞格·布羅克曼米拉·穆拉蒂伊爾亞·蘇茨克維
有關	AI Dungeon（英語：AI Dungeon） Auto-GPT "Deep Learning（英語：Deep Learning (South Park)）" Microsoft 365 Copilot Microsoft Bing
分類共享資源