JNNCE ACM STUDENT Chapter

OpenAI - ChatGPT

12 Comments

Chat GPT is defined as a generative language model. However in practice it is understood as an artificial intelligence chat that has been trained and designed to hold natural conversations.
Chat GPT belongs to the research company OpenAI, founded in San Francisco in 2015 by Sam Altman, Elon Musk, Greg Brockman, Ilya Sutskever and Wojciech Zaremba.

ChatGPT was launched as a prototype on November 30, 2022, and quickly garnered attention for its detailed responses and articulate answers across many domains of knowledge. Its uneven factual accuracy was identified as a significant drawback. Following the release of ChatGPT, OpenAI was valued at $29 billion.
Elon Musk found that OpenAI was training ChatGPT using Twitter’s database. He instantly put an end to the exercise. Since OpenAI is now non-profit and closed-sourced, it should someday be able to pay for this information.

How does it work

GPT-3 is a generative language model based on the transformer architecture, which is capable of processing large amounts of text and performing natural language processing tasks effectively.
The model is trained on a large dataset of text, such as 8 million documents and 10 billion words, to learn how to generate coherent and well-written text.
To improve the model's performance, reinforcement learning is used with a reward model that is based on human feedback.
The reward model is created by collecting comparison data, which is generated by randomly selecting conversation transcripts and having human coaches rank the quality of different model responses.
The training is done on a Microsoft Azure platform using Proximal Policy Optimization.
Once trained, GPT can be used to generate appropriate and coherent responses to a given input, such as a question or context sentence, for use in chatbots or other applications.

WHAT DO YOU MEAN BY TRANSFORMER BASED?

A "transformation" is a type of data processing that is performed on a sequence of elements, such as words in a sentence or characters in a word.
"Transformers" are machine learning models specifically designed to process sequences of elements using transformations.
The transformer architecture is based on the use of attention, a technique that allows the model to selectively focus on different parts of the input sequence while processing it.
The attention mechanism allows the transformer to process information more effectively and perform natural language processing tasks more accurately.
GPT (Generative Pre-training Transformer) is a generative language model based on the transformer architecture.
This architecture is very effective for performing natural language processing tasks and has revolutionized the way many NLP tasks are approached.

detailed info

OpenAI - ChatGPT

How does it work

WHAT DO YOU MEAN BY TRANSFORMER BASED?

Haris Husain Khan H S

Leave a Reply