From the course: Machine Learning and AI in Cybersecurity by Pearson

Unlock this course with a free trial

Join today to access over 25,300 courses taught by industry experts.

What are large language models?

What are large language models?

So, let's begin with what are large language models? Well, you probably already know that they're a type of artificial intelligence, and the goal is to both understand and to generate human-like text. This is done based on the data the MAP model has been trained on. These models are built using deep learning techniques, specifically neural networks, This enables them to process and learn from vast amounts of text data. The reason we call them large language models is because they're characterized by their large number of parameters. These parameters are also used as the weights in the neural network and get adjusted during training. Even GPT-3 had hundreds of billions of parameters. The more parameters a model has, the more complex the patterns it can learn from the data. Now, these models get trained on diverse and extensive data sets that includes books, articles, websites, and pretty much any text source they can find. This extensive training allows the model to generate coherent…

Contents