********************************************************************


                                                     Seminar

             Department of Systems Engineering and Engineering Management
                                  The Chinese University of Hong Kong

------------------------------------------------------------------------------------------

 

 

 

Title

:

Mining Patterns and Building Classifiers from Software Data: Addressing Software Maintenance and Reliability Issues

 

 

 

Speaker

:

Prof. David Lo

 

 

School of Information Systems

 

 

Singapore Management University

 

 

 

Date

:

June 19th, 2009 (Friday)

 

 

 

Time

:

4:30 p.m. - 5:30 p.m.

 

 

 

Venue

:

Room 513

 

 

William M.W. Mong Engineering Building

 

 

(Engineering Building Complex Phase 2)

 

 

CUHK

 

 

 

------------------------------------------------------------------------------------------

Abstract:
 

Software is a ubiquitous component of our daily life. There are many issues related to software development; these include: reducing cost involved in maintaining a software systems, and ensuring reliability of systems. Can data mining help?

Studies have shown that program comprehension takes up to 45% of software costs. One contributing factor is the lack of documented specification. In the first part of the talk, a technique to efficiently mine common software temporal patterns serving as candidate specifications would be described. This work extends latest study in sequential pattern mining and episode mining by mining a compact representative set of patterns that repeat frequently within a sequence and across many sequences. These patterns in turn could be post-processed to form rules and UML sequence diagrams and fed to downstream program analysis tools.

We often depend on the correct workings of software systems. Due to the difficulty and complexity of software systems, bugs and anomalies are prevalent. In the second part of the talk, we discuss a technique to classify software behaviors based on past history or runs. With the technique, it is possible to generalize past known erroneous behaviors to capture other failures. This work proposes a new pattern-based classification approach working on a set of sequences and applies it for software failure detection.


-------------------------------------------------------------------------------------------

Biography:
 

David Lo is an Assistant Professor in the School of Information Systems, Singapore Management University. He received his Ph.D. in Computer Science from the National University of Singapore in 2008. Before that he receives his B.Eng. in Computer Engineering from Nanyang Technological University in 2004. His research interests are frequent pattern mining, classification, reverse engineering, software maintenance, and software reliability. His work has been published in various venues in both data mining and software engineering area including: ICDE, KDD, SDM, FSE, ASE, JSME, etc.


************************* ALL ARE WELCOME ************************

 

 

 

Host

:

Prof. Cheng, Hong

Tel

:

(852) 2609-8300

Email

:

hcheng@se.cuhk.edu.hk

 

 

 

Enquiries

:

Prof. Nan Chen or Prof. Sean X. Zhou

 

:

Department of Systems Engineering and Engineering Management

 

 

CUHK

Website

:

http://www.se.cuhk.edu.hk/~seg5810

Email

:

seg5810@se.cuhk.edu.hk

 

 

 

********************************************************************