Μετεωρολογικές βάσεις δεδομένων: εφαρμογές εξόρυξης πληροφορίας και επίδραση της διαμόρφωσης της εισόδου στην απόδοσή τους

doi:10.12681/eadd/26096

Home

Browse

Discipline

Date

Author

Country

Language

Degree Grantor

About

Theses Submission

FAQ

Helpdesk

Open Data

Abstract

Database management systems (DBMS) were developed to collect, store, organize and manage data. Data and information are retrieved from databases through known and clearly formulated questions (queries) and, additionally, through information discovery with the application of data mining techniques. Data mining algorithms operate on data and discover previously unknown information. In this thesis, a meteorological database is first designed and then target data is used in data mining applications and for conducting research work using a modified Knowledge Discovery from Databases (KDD) procedure. Data Mining applications concerning the operational data of the National Hail Suppression Program of the Hellenic Agricultural Insurance Organization are the Hail class estimation, Maximum hail size prediction, Prediction of hail suppression program seeding parameters, and Extraction of the observed convective day category index. The process of Knowledge Discovery from the meteorological database is used to conduct research work by appropriately modifying the CRISP-DM model. The goal is to build one or more data mining models in order to identify the occurrence of precipitation at a point on the ground, using data from a meteorological station of the National Meteorological Service and the whole ERA-40 dataset of the European Centre for Medium-range Weather Forecast (ECMWF). Different scenarios and strategies are formulated for the selection or transformation of the input to data mining techniques, which rely mainly on empirical knowledge of the field data and are used to consider issues that may affect the performance of five classification algorithms. More specifically, the effect the training dataset size has on the performance of the algorithms is studied and the optimal size that ensures the best performance of each algorithm is determined. Furthermore, the study of two different approaches for the formation of training datasets demonstrates that the performance of the algorithms is independent of the choice of the instances, i.e., when random instances or all the instances of randomly selected years are used. During the process of weather forecasting in a region, operational meteorologists usually examine the temporal changes of the meteorological parameters. Considering three different scenarios related to the transformation of the independent variables or input characteristics, the performance of the classification algorithms is better when normal parameter values rather than temporal changes are used. Note that these three scenarios are examined both for the natural distribution of data on the dependent variable and the balanced distribution using the random under resampling method. The distribution of the dependent precipitation class variable raises the class imbalance issue, the handling of which is attempted with the implementation of various methods. More specifically, nine techniques of the resampling method beyond the natural distribution are applied. They are drawn from the literature or are newly proposed based on meteorological expertise. Additionally, the boosting method AdaBoost M1 is applied to improve the performance of classification algorithms. The results show that the performance of only one algorithm is not affected by the application of these techniques when compared to the natural distribution. The performance of the remaining four algorithms improves significantly, particularly when the new proposed technique that is based on meteorological expertise is used.

	Read Online
	Download full text in PDF format (2.04 MB) (Available only to registered users) I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the

All items in National Archive of Phd theses are protected by copyright.

DOI	10.12681/eadd/26096
Handle URL	http://hdl.handle.net/10442/hedi/26096
ND	26096
Alternative title	Μετεωρολογικές βάσεις δεδομένων: εφαρμογές εξόρυξης πληροφορίας και επίδραση της διαμόρφωσης της εισόδου στην απόδοσή τους
Author	Tsagalidis, Evangelos (Father's name: Georgios)
Date	2011
Degree Grantor	University of Macedonia Economic and Social Sciences
Committee members	Ευαγγελίδης Γεώργιος Σατρατζέμη Μαρία Δερβός Δημήτριος Παπαναστασίου Δημήτριος Μαργαρίτης Κωνσταντίνος Μελάς Δημήτριος Σαμαράς Νικόλαος
Discipline	Natural Sciences Computer and Information Sciences
Keywords	Meteorological databases; Data mining; Knowledge discovery in databases; Training dataset; Class imbalance; Precipitation prediction; Hail size prediction; Hail suppression program seeding parameters
Country	Greece
Language	Greek
Description	xii, 130 σ., tbls., fig., ch., ind.

Usage statistics

VIEWS

Concern the unique Ph.D. Thesis' views for the period 07/2018 - 07/2023.
Source: Google Analytics.

ONLINE READER

Concern the online reader's opening for the period 07/2018 - 07/2023.
Source: Google Analytics.

DOWNLOADS

Concern all downloads of this Ph.D. Thesis' digital file.
Source: National Archive of Ph.D. Theses.

USERS

Concern all registered users of National Archive of Ph.D. Theses who have interacted with this Ph.D. Thesis. Mostly, it concerns downloads.
Source: National Archive of Ph.D. Theses.

Related items (based on users' visits)

Ανάπτυξη παλιρροϊκού μοντέλου για τη Μεσόγειο Θάλασσα με αφομοίωση αλτιμετρικών δεδομένων και δεδομένων από παλιρροϊκούς σταθμούς σε υδροδυναμικά μοντέλα

Πρόβλεψη των χωροχρονικών μεταβολών της στάθμης υπογείων υδάτων με χρήση τεχνητών νευρωνικών δικτύων και γεωστατιστικών μεθόδων

Εκτίμηση και προληπτικός σχεδιασμός αντιμετώπισης της ξηρασίας

Σχεδιασμός και ανάπτυξη ενός συστήματος αναγνώρισης γεωτεμαχίων με βάση κτηματολογικές και γεωργικές καταγραφές

Στρατός και πολιτική εξουσία στη μετεμφυλιακή Ελλάδα (1949-1967)

Η εξέλιξη της βρετανικής αποτρεπτικής δύναμης στον ψυχρό πόλεμο: η ικανότητα δεύτερου πλήγματος εναντίον της Μόσχας και η μεταβίβαση του αποτρεπτικού μέσου από την Αεροπορία (Royal Air Force) στο Ναυτικό (Royal Navy): ανάλυση έξι αποφάσεων προμηθειών από το Μάρτιο του 1955 έως τον Ιανουάριο του 1968

Μερικές διαφορικές εξισώσεις και προβλήματα της επιστήμης των υλικών

Εφαρμογές της επιστήμης του χάους και της πολυπλοκότητας στη μελέτη γεωφυσικών και διαστημικών φαινομένων

Μελέτη της στοχαστικότητας τροχιών μη γραμμικών δυναμικών συστημάτων: συστήματα με επαναλαμβανόμενες σκεδάσεις

Εξυγίανση ρυπασμένου υπόγειου υδροφορέα από οργανικούς και ανόργανους ρύπους με εφαρμογή της τεχνολογίας των διαπερατών αντιδρώντων φραγμάτων

"Meteorological databases: data mining applications and the effect of configuration of input in their performance"
	Please, type what you see in the image!
I declare that I have read and unconditionally agree and accept the Terms of Use of the National Archive of Ph.D. Theses, as well as the.