On sampling strategies for small and continuous data with the modeling of genetic programming and adaptive neuro-fuzzy inference system

S. Sen, E. A. Sezer, C. Gokceoglu, S. Yagiz

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

Sampling strategies which have very significant role on examining data characteristics (i.e. imbalanced, small, exhaustive) have been discussed in the literature for the last couple decades. In this study, the sampling problem encountered on small and continuous data sets is examined. Sampling with measured data by employing k-fold cross validation, and sampling with synthetic data generated by fuzzy c-means clustering are applied, and then the performances of genetic programming (GP) and adaptive neuro fuzzy inference system (ANFIS) on these data sets are discussed. Concluding remarks are that when the experimental results are considered, fuzzy c-means based synthetic sampling is more successful than k-fold cross validation while modeling small and continous data sets with ANFIS and GP, so it can be proposed for these type of data sets. Additionally, ANFIS shows slightly better performance than GP when sytnthetic data is employed, but GP is less sensitive to data set and produces ouputs that are narrower range than ANFIS's outputs while k-fold cross validation is employed.

Original languageEnglish
Pages (from-to)297-304
Number of pages8
JournalJournal of Intelligent and Fuzzy Systems
Volume23
Issue number6
DOIs
Publication statusPublished - Dec 1 2012
Externally publishedYes

Fingerprint

Adaptive Neuro-fuzzy Inference System
Sampling Strategy
Genetic programming
Fuzzy inference
Genetic Programming
Sampling
Modeling
Cross-validation
Fold
Computer systems programming
Fuzzy C-means Clustering
Fuzzy C-means
Synthetic Data
Output
Experimental Results

Keywords

  • adaptive neuro-fuzzy inference system
  • genetic programming
  • Sampling strategies
  • small and continuous data

ASJC Scopus subject areas

  • Statistics and Probability
  • Engineering(all)
  • Artificial Intelligence

Cite this

On sampling strategies for small and continuous data with the modeling of genetic programming and adaptive neuro-fuzzy inference system. / Sen, S.; Sezer, E. A.; Gokceoglu, C.; Yagiz, S.

In: Journal of Intelligent and Fuzzy Systems, Vol. 23, No. 6, 01.12.2012, p. 297-304.

Research output: Contribution to journalArticle

@article{e62c6f2bb4664b3683fabade4a48188b,
title = "On sampling strategies for small and continuous data with the modeling of genetic programming and adaptive neuro-fuzzy inference system",
abstract = "Sampling strategies which have very significant role on examining data characteristics (i.e. imbalanced, small, exhaustive) have been discussed in the literature for the last couple decades. In this study, the sampling problem encountered on small and continuous data sets is examined. Sampling with measured data by employing k-fold cross validation, and sampling with synthetic data generated by fuzzy c-means clustering are applied, and then the performances of genetic programming (GP) and adaptive neuro fuzzy inference system (ANFIS) on these data sets are discussed. Concluding remarks are that when the experimental results are considered, fuzzy c-means based synthetic sampling is more successful than k-fold cross validation while modeling small and continous data sets with ANFIS and GP, so it can be proposed for these type of data sets. Additionally, ANFIS shows slightly better performance than GP when sytnthetic data is employed, but GP is less sensitive to data set and produces ouputs that are narrower range than ANFIS's outputs while k-fold cross validation is employed.",
keywords = "adaptive neuro-fuzzy inference system, genetic programming, Sampling strategies, small and continuous data",
author = "S. Sen and Sezer, {E. A.} and C. Gokceoglu and S. Yagiz",
year = "2012",
month = "12",
day = "1",
doi = "10.3233/IFS-2012-0521",
language = "English",
volume = "23",
pages = "297--304",
journal = "Journal of Intelligent and Fuzzy Systems",
issn = "1064-1246",
publisher = "IOS Press",
number = "6",

}

TY - JOUR

T1 - On sampling strategies for small and continuous data with the modeling of genetic programming and adaptive neuro-fuzzy inference system

AU - Sen, S.

AU - Sezer, E. A.

AU - Gokceoglu, C.

AU - Yagiz, S.

PY - 2012/12/1

Y1 - 2012/12/1

N2 - Sampling strategies which have very significant role on examining data characteristics (i.e. imbalanced, small, exhaustive) have been discussed in the literature for the last couple decades. In this study, the sampling problem encountered on small and continuous data sets is examined. Sampling with measured data by employing k-fold cross validation, and sampling with synthetic data generated by fuzzy c-means clustering are applied, and then the performances of genetic programming (GP) and adaptive neuro fuzzy inference system (ANFIS) on these data sets are discussed. Concluding remarks are that when the experimental results are considered, fuzzy c-means based synthetic sampling is more successful than k-fold cross validation while modeling small and continous data sets with ANFIS and GP, so it can be proposed for these type of data sets. Additionally, ANFIS shows slightly better performance than GP when sytnthetic data is employed, but GP is less sensitive to data set and produces ouputs that are narrower range than ANFIS's outputs while k-fold cross validation is employed.

AB - Sampling strategies which have very significant role on examining data characteristics (i.e. imbalanced, small, exhaustive) have been discussed in the literature for the last couple decades. In this study, the sampling problem encountered on small and continuous data sets is examined. Sampling with measured data by employing k-fold cross validation, and sampling with synthetic data generated by fuzzy c-means clustering are applied, and then the performances of genetic programming (GP) and adaptive neuro fuzzy inference system (ANFIS) on these data sets are discussed. Concluding remarks are that when the experimental results are considered, fuzzy c-means based synthetic sampling is more successful than k-fold cross validation while modeling small and continous data sets with ANFIS and GP, so it can be proposed for these type of data sets. Additionally, ANFIS shows slightly better performance than GP when sytnthetic data is employed, but GP is less sensitive to data set and produces ouputs that are narrower range than ANFIS's outputs while k-fold cross validation is employed.

KW - adaptive neuro-fuzzy inference system

KW - genetic programming

KW - Sampling strategies

KW - small and continuous data

UR - http://www.scopus.com/inward/record.url?scp=84872313045&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84872313045&partnerID=8YFLogxK

U2 - 10.3233/IFS-2012-0521

DO - 10.3233/IFS-2012-0521

M3 - Article

VL - 23

SP - 297

EP - 304

JO - Journal of Intelligent and Fuzzy Systems

JF - Journal of Intelligent and Fuzzy Systems

SN - 1064-1246

IS - 6

ER -