Please use this identifier to cite or link to this item:
http://dspace.azjhpc.org/xmlui/handle/123456789/194
Title: | How chatgpt Works: Understanding the Architecture and Training Process of chatgpt |
Authors: | Hajiyev, Aligulu |
Keywords: | chatgpt;openai;probability distribution;self-attention mechanisms |
Issue Date: | 11-May-2023 |
Publisher: | Azərbaycan Dövlət Neft və Sənaye Universiteti |
Abstract: | Chatgpt is an openai conversational AI system built on a transformer architecture with self-attention methods. Openai used a vast quantity of text data from the internet to train the model, and the machine learnt from the data via unsupervised learning. The model fine-tuned on a smaller dataset of talks following training to increase its ability to provide coherent and contextually relevant replies. During inference, the model analyses the input text and develops a probability distribution across all potential answers, after which the response with the highest probability chosen as the output. Overall, chatgpt is a remarkable achievement of natural language processing and deep learning, with several potential applications in customer assistance, education, content development, and targeted marketing. |
URI: | http://dspace.azjhpc.org/xmlui/handle/123456789/194 |
Journal Title: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
metadata.dc.source.booktitle: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
Volume: | 1 |
Issue: | 1 |
First page number: | 65 |
Last page number: | 68 |
Number of pages: | 4 |
Appears in Collections: | 1st INTERNATIONAL CONFERENCE ON THE 4th INDUSTRIAL REVOLUTION AND INFORMATION TECHNOLOGY |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
Industy_4-65-68.pdf | 268.2 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.