Professional Certificate in AI in Chemistry · Guide

Fundamentals of Machine Learning

5 min read Updated 4 May 2026

Machine Learning (ML) is a fundamental aspect of Artificial Intelligence (AI) that allows computers to learn and improve their performance on a specific task without explicit programming. In this explanation, we will cover key terms and vocabulary related to the Fundamentals of Machine Learning in the course Professional Certificate in AI in Chemistry.

1. Supervised Learning: This is a type of machine learning where the model is trained on a labeled dataset. In other words, the input data and the corresponding output data are provided to the model during training. The model learns the relationship between the input and output data and uses it to make predictions on new, unseen data. Examples of supervised learning algorithms include linear regression, logistic regression, and support vector machines. 2. Unsupervised Learning: Unlike supervised learning, unsupervised learning involves training the model on an unlabeled dataset. The model is left to find patterns and relationships in the data without any prior knowledge of the output. Examples of unsupervised learning algorithms include clustering algorithms such as k-means and hierarchical clustering, and dimensionality reduction algorithms such as principal component analysis (PCA). 3. Semi-supervised Learning: This is a combination of supervised and unsupervised learning, where the model is trained on a dataset that is partially labeled. This approach is useful when obtaining labeled data is costly or time-consuming. 4. Regression: Regression is a type of supervised learning algorithm used for predicting a continuous output variable. Linear regression is a simple regression algorithm that models the relationship between the input and output variables as a linear function. 5. Classification: Classification is another type of supervised learning algorithm used for predicting a categorical output variable. Logistic regression is a popular classification algorithm that models the probability of the output variable belonging to a certain class. 6. Overfitting: Overfitting occurs when a machine learning model is too complex and learns the noise in the training data instead of the underlying pattern. This results in the model performing well on the training data but poorly on new, unseen data. Regularization techniques such as L1 and L2 regularization can help prevent overfitting. 7. Underfitting: Underfitting occurs when a machine learning model is too simple and fails to learn the underlying pattern in the training data. This results in the model performing poorly on both the training and new, unseen data. 8. Bias-Variance Tradeoff: The bias-variance tradeoff is a fundamental concept in machine learning that refers to the balance between the complexity of the model and its ability to generalize to new data. A high bias model has low variance and is prone to underfitting, while a high variance model has low bias and is prone to overfitting. 9. Cross-validation: Cross-validation is a technique used to evaluate the performance of a machine learning model. The data is split into k folds, and the model is trained on k-1 folds while the remaining fold is used for testing. This process is repeated k times, with a different fold used for testing each time. The average performance across all k trials is used as the final performance metric. 10. Hyperparameter Tuning: Hyperparameters are parameters that are set before training the model, such as the learning rate or the number of hidden layers in a neural network. Hyperparameter tuning involves selecting the best set of hyperparameters for the model to improve its performance. 11. Gradient Descent: Gradient descent is an optimization algorithm used to find the minimum of a function. It works by iteratively moving in the direction of the negative gradient of the function until it reaches a minimum. 12. Activation Function: An activation function is a function used in neural networks to introduce non-linearity into the model. Common activation functions include the sigmoid, tanh, and ReLU functions. 13. Backpropagation: Backpropagation is an algorithm used to train neural networks. It works by computing the gradient of the loss function with respect to each weight in the network and adjusting the weights accordingly. 14. Deep Learning: Deep learning is a subfield of machine learning that focuses on neural networks with many layers. These models are capable of learning complex patterns and representations from large datasets. 15. Transfer Learning: Transfer learning is a technique used in deep learning where a pre-trained model is fine-tuned on a new dataset. This saves time and resources as the model does not need to be trained from scratch. 16. Natural Language Processing (NLP): NLP is a field of AI that focuses on the interaction between computers and human language. It involves tasks such as language translation, sentiment analysis, and text summarization. 17. Computer Vision: Computer vision is a field of AI that focuses on enabling computers to interpret and understand visual information from the world. It involves tasks such as image recognition, object detection, and image segmentation.

In the context of the Professional Certificate in AI in Chemistry, machine learning can be used for various applications such as predicting chemical properties, drug discovery, and material science. For example, a machine learning model can be trained on a dataset of known chemical compounds and their corresponding properties. The model can then be used to predict the properties of new, unseen compounds, which can save time and resources in laboratory experiments.

However, there are challenges in applying machine learning to chemistry, such as the high dimensionality and complexity of chemical data. Feature selection and dimensionality reduction techniques can help address these challenges by identifying the most relevant features and reducing the number of features in the dataset.

In conclusion, machine learning is a powerful tool in AI that enables computers to learn and improve their performance on a specific task. Understanding the key terms and vocabulary related to the Fundamentals of Machine Learning is essential for success in the course Professional Certificate in AI in Chemistry. With the right knowledge and skills, machine learning can be applied to various applications in chemistry, such as predicting chemical properties and drug discovery.

Key takeaways

Machine Learning (ML) is a fundamental aspect of Artificial Intelligence (AI) that allows computers to learn and improve their performance on a specific task without explicit programming.
Examples of unsupervised learning algorithms include clustering algorithms such as k-means and hierarchical clustering, and dimensionality reduction algorithms such as principal component analysis (PCA).
In the context of the Professional Certificate in AI in Chemistry, machine learning can be used for various applications such as predicting chemical properties, drug discovery, and material science.
Feature selection and dimensionality reduction techniques can help address these challenges by identifying the most relevant features and reducing the number of features in the dataset.
Understanding the key terms and vocabulary related to the Fundamentals of Machine Learning is essential for success in the course Professional Certificate in AI in Chemistry.

Fundamentals of Machine Learning

Key takeaways

More from Professional Certificate in AI in Chemistry