TY - GEN
T1 - Ranking importance based information on the world wide web
AU - Maan, Akshay Kumar
AU - James, Alex Pappachen
PY - 2012/9/17
Y1 - 2012/9/17
N2 - Identifying useful features for classification and forecast tasks from a ranked data is highly difficult and challenging. By ranking user popularity ratings from normalised area histograms, a method of feature selection for ranked data inspired from the law of vital few is proposed. We propose that the attributes that are most stable against the variations in classes have their usefulness in a forecasting task, while the attributes that are most unstable between inter-class samples but most stable within intra-class samples have their usefulness in classification tasks. The performance of the proposed method is demonstrated through a realistic example of web-content data from Yahoo! research repository: the user rating of web pages. The attributes in the data when ranked based on their importance in a year show distinct characteristics of performance in the tasks of popularity forecast and classification.
AB - Identifying useful features for classification and forecast tasks from a ranked data is highly difficult and challenging. By ranking user popularity ratings from normalised area histograms, a method of feature selection for ranked data inspired from the law of vital few is proposed. We propose that the attributes that are most stable against the variations in classes have their usefulness in a forecasting task, while the attributes that are most unstable between inter-class samples but most stable within intra-class samples have their usefulness in classification tasks. The performance of the proposed method is demonstrated through a realistic example of web-content data from Yahoo! research repository: the user rating of web pages. The attributes in the data when ranked based on their importance in a year show distinct characteristics of performance in the tasks of popularity forecast and classification.
KW - WWW
KW - feature selection
KW - ranked distribution
KW - user rating
UR - http://www.scopus.com/inward/record.url?scp=84866060820&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84866060820&partnerID=8YFLogxK
U2 - 10.1145/2345396.2345540
DO - 10.1145/2345396.2345540
M3 - Conference contribution
AN - SCOPUS:84866060820
SN - 9781450311960
T3 - ACM International Conference Proceeding Series
SP - 889
EP - 897
BT - Proceedings of the 2012 International Conference on Advances in Computing, Communications and Informatics, ICACCI'12
T2 - 2012 International Conference on Advances in Computing, Communications and Informatics, ICACCI 2012
Y2 - 3 August 2012 through 5 August 2012
ER -