Machine Learning based analysis of users’ online behaviour
Events and activities of daily life are increasingly often taking place in the online space, including, for example, the purchase of durable goods and education. Both of these areas, shopping and learning, which until a few years ago existed almost exclusively in the traditional offline format, have changed significantly. This change poses new challenges for professionals working in these fields, as most of the methods and methodologies used to date have become completely obsolete and unworkable in the online space. This is particularly true of the expertise of offline shop assistants or the role of teachers in brick-and-mortar educational facilities, roles which were
once indispensable, but have now become outdated. The disappearance of these roles has not gone unnoticed, given that many online businesses are struggling with dwindling customer numbers and decreasing effectiveness of online learning systems (such as Massive Open Online Courses – MOOCs) with effectiveness at barely 25-30%. While it is undeniable that the online presence has created considerable challenges for business and education managers, it has also opened up new opportunities that can be exploited, notably by involving data science professionals.
The topic of this idissertation is the development of different Machine Learning methods for webshop and MOOC applications based on log data analysis. What all applications have in common is the creation of aggregated databases, socalled user profiles, using log data of different widths and depths, which are used for classification, regression or even clustering. For more than fifteen years now there has been active research on the analysis of user log data. Initially, research and development were carried out in isolation on small databases in research teams or on closed internal databases in companies. In recent years, as online business and online educational interfaces have become more common, the number of real business applications and the amount and depth of data generated by each application have increased. Therefore, the previously traditional feature extraction and Machine Learning methods have been replaced by Deep Learning methods, which can provide high-quality solutions for large amounts of data, even starting from low-level data.
https://doktori.bibl.u-szeged.hu/id/eprint/11194/
https://doktori.bibl.u-szeged.hu/id/eprint/11194/3/Korosi_Gabor_Thesis.pdf