Till innehåll på sidan

Sheila Farrahi: Feature Construction of Multi-sets for Machine Learning Application

Master Thesis

Tid: On 2024-02-07 kl 09.50 - 10.30

Plats: Cramerrummet (Albano, SU)

Respondent: Sheila Farrahi

Handledare: Chun-Biu Li

Exportera till kalender

Abstract.

Many machine learning algorithms require inputs in the form of fixed length vectors and cannot directly process data in the form of multi-sets. In reality, raw data does not always exist in the form of fixed length vectors and can exist in the form of unordered multi-sets of scalar values, where the number of elements in each multi-set can often vary across multiple data instances. Feature construction is a technique to find a better data representation for the machine learning algorithms in case the original representation of data is not in the form of fixed length vectors.

Statistical measures and density estimation provide insights into the data, therefore they can be used as tools in constructing fixed length vectors of features from unordered multi-sets. In this report, several methods based on statistical measures and density estimation are explored for feature constructions from unordered multi-sets.