Skip to content | Change text size
   

Research Strengths

Listed below are the research strengths of the School of Information Technology:
1. Research in Intelligent Systems
[+/-]Description

Research in intelligent systems improves the intelligence of computational systems and processes, including both artificial and natural computation.

Strengths include: Bayesian Reasoning, Complex Systems, Computer Vision, Intelligent Adaptive Diagrams and Documents, Constraint Programming, Data Mining, Diagram Interpretation and Understanding, Machine Learning, Natural Computation, Natural Language Processing, Neural Networks, Optimisation, Reasoning Under Uncertainty, Robotics and User Modelling.


[+/-]Publications

Tan C.L., Egerton S. & Ganapathy V. (2007), "Real-Time SLAM using Binocular Vision for Autonomous Mobile Robots", Conference on Innovative Technologies in Intelligent Systems & Industrial Applications (CITISIA 2007), 17-19 November 2007, Selangor, Malaysia.

Teo Y.J. & Loke K.S. (2007), "Intelligent Load Balancing Algorithm for a Mobile Agent System", Conference on IT Research and Applications (CITRA 2007), 4-5 April, Selangor, Malaysia.

Alhashmi S.M. and Siddiqi J. (2007), "Improving the Decision-Making Facet within Healthcare Logistics using Multiagents", MMU International Symposium on Information and Communications Technologies 2007 (M2USIC 2007), 19-20 November, Selangor, Malaysia.         

Loke, K.S. & Tan, C.L. (2006), “Diagram Matching Based on the Dual Ant Colony Optimization Algorithm”, The 2nd International Conference on Natural Computation and the 3re International Conference on Fuzzy Systems and Knowledge Discovery (ICNC-FSKD 2006), 24-28 Sept, Xi’an, China.

Koh, S.Y., Leow, S.K. & Loke, K.S., (2005), “Artificial Intelligence Applications and Innovations II”, 2ndInternational Federation for Information Processing Conference on Artificial Intelligence Applications and Innovations (AIAI2005), 7-9 September, Beijing, China.

Pang, L.S. & Khong, K.W., (2005), “An Interaction Framework for Knowledge Management”, 7th International Research Conference on Quality, Innovation and Knowledge Management, 16-18 February, Kuala Lumpur, Malaysia.

Lim, T.M., (2005), “Heuristic-Based Self-Learning Framework Management System”, National Conference on Computer Science, Technology and Networking 2005, 6-7 December, Shah Alam, Malaysia.

Lina, Lim, T.M. & Leow S.K., (2005), “A Hybrid Symbolic-Connectionist Approach for Real-Valued Pattern Classification”, 2ndInternational Federation for Information Processing Conference on Artificial Intelligence Applications and Innovations (AIAI2005), 7-9 September, Beijing, China.

Loke, K.S., (2005), “Notes on Defining Novelty for Computational Creativity”, 19th International Joint Conference on Artificial Intelligence, 30 Jul – 5 Aug, Edinburgh, Scotland.

Phung, Y.C., (2005), “Text Mining for Stock Movement Predictions: A Malaysian Perspective”, 6th International Conference on Data Mining, 25-27 May, Skiathos, Greece.

Leow, S.K. and Koh, S.Y., (2004), “Using the Invariant Optimal Assignment of a k-out-of-n: G System to Test the Effectiveness of Genetic Algorithms”, 8th IEEE International Conference on Intelligent Engineering Systems 2004 (INES 2004), 19-21 September, Romania.

Munusamy, M. and Leow, S.K (2004), “A Conceptual Framework for the Integration of Clustering and GA for a Web Personalization System”, IEEE 4th International Conference on Intelligent Systems Design and Application (ISDA 2004), 26-28 August, Budapest, Hungary.


[+/-]Staff

[+/-]Higher Degree by Research (HDR) students

Doctor of Philosophy (PhD)

1) Name: Hong Jer Lang

Email: jlhon2@student.monash.edu

Supervisors:
Dr Simon Egerton (Main)
Dr Siew Eu-Gene (Joint)

[+/-]Project Title: Information Extraction in Web using Statistical and Visual Knowledge-Beyond Text Mining

The development of the Internet during the later half of the last century, and its subsequent exponential growth, played a defining role in the information revolution and established the Information Age that we all now inhabit.  The adoption of the Internet as a primary source of information, coupled with the geometrical growth of computing power, as predicted by Moore’s Law, has seen the amount and variety of information accessible through the Internet also grow exponentially, this exponential growth continues today and shows no sign yet of abating. We now have access to more information than ever before via the Internet, which raises the question, how do we find the information we want in this vast sea of information? Search engines, such as the ubiquitous Google, provide one solution. However, many search engines return additional, and often unnecessary, information, such as advertisements.  Moreover, the search may return incomplete data. Generally, users are only interested in retrieving the specific information they are searching for, and ideally they want that information to be as complete as possible.  In addition, users would like to find the required information simply and quickly, it becomes time consuming when the user has to manually search through pages of search results to locate the required information, in effect conducting a manual meta-search themselves.  The solution to this problem is Information Extraction, which can be used to automate this meta-search.  Methods for doing this are termed wrappers.  Wrappers search through and extract information from one or many web pages, collating relevant information.  Wrappers need to deal with semi-structured information and the many ambiguities of the HTML language.  Wrappers were initially constructed by hand to extract information from specific types of web page.  However, this method soon becomes impractical, virtually every type of pages needs its own wrapper, and the wrappers need maintaining and updating should their target web pages change their layout.  The solution of course is to automate the wrapper method.  Essentially, automatic wrappers attempt to find structure within the target web page and extract information accordingly, as data records, for example.  Early automatic wrappers used HTML tags to determine structure.  However, the accuracy and precession of these methods are affected by ambiguities in the HTML language, furthermore if the information in the web page is not uniformly presented.  To overcome these limitations wrappers used additional visual cues, such as context, in font, color, style, size and relative object positions with the web pages, pictures, for example. This research focuses on developing a new Information Extraction tool by combining statistical and visual knowledge, whereby the limitations in the current tools are minimized, notably in developing an automatic wrapper for extracting multiple record sections and providing ease of implementation that can be easily extendable in the future.

Doctor of Philosophy (PhD)

2) Name: Tan Choon Ling

Email: cltan4@student.monash.edu

Supervisors:
Dr Simon Egerton (Main)
Dr Velappa Ganapathy (Joint)

[+/-]Project Title: A Robust Real-Time Binocular Vision Feature Tracking Model for Autonomous Robots in SLAM

Tracking of features is crucial in areas such as computer vision and image processing. With the increased availability and affordability of web cameras, new features can potentially be detectable and utilized in manners that were not previously possible with other types of sensors. Therefore, we describe a model in which these potential features can be tracked both in an indoors and outdoors environment.  This feature tracking model is then applied to autonomous mobile robots to facilitate the process of Simultaneous Localization and Mapping (SLAM) in real-time. The implementation of feature tracking is intended to be done with binocular vision (two cameras) due to cheaper and smaller components.
In order to obtain sufficiently satisfying performances, many factors have to be considered. A robust and reliable world coordinate system is needed in order to allow the robot to track and traverse around real-world objects located in a given environment. The feature tracking ability of the cameras must not only be computationally inexpensive, but must also possess a high degree of accuracy. In order to compromise between computational complexity and acceptable results, it should also be known which types of features are allowed to be tracked, and using which specific algorithmic methods.


[+/-]Research Grants

Principal investigator

Project Title

Source

Dr Saadat Alhasmi

Formal Support for Data Modelling and Process Modelling in Halal Industry

Monash Sunway campus Seed Funding (2008)

Dr Saadat Alhashmi

Process Re-engineering to Counter the Problem of Non-Genuine Halal Products that are Sold in the Market

Monash Sunway campus Seed Funding (2008)

Dr Siew Eu-Gene

Classifying the Financial Health of Malaysian Listed Companies Using Data Mining Techniques

Monash Sunway campus Seed Funding (2008)

Dr Simon Egerton

Quantum Controllers for Co-operative Autonomous Bipedal Robots

Monash Sunway campus Seed Funding (2007)



2. Multimedia Computing, Communications and Applications Research
[+/-]Description

This area includes research in medical image processing, digital library, remote surveillance, network resource management, mobile communications, emerging technologies such as wireless sensor and mesh networks and innovative applications in learning, business, games, archaeology and art.


[+/-]Publications

See K.W., Loke K.S., Lee P.A. and Loe K.F. (2007), “Image Reconstruction using Various Discrete Orthogonal Polynomials in Comparison with DCT”, Journal of Applied Mathematics. Computation, Vol 193 pp. 346-359

See K.W., Loke K.S, Lee P.A. and Loe K.F. (2007), “Comparative Study of Image Reconstruction using Discrete Orthogonal Polynomials”, Conference on IT Research and Applications 2007 (CITRA 2007), 4-5 April, 2007, Kuala Lumpur, Malaysia.     

Lee K.M., Belkhatir M., Lee P.A., Sanei S. and Loe K.F. (2007), “Chaotic Characterisation of FRONTO-Normal Gait for Human Identification “, 15th European Signal Processing Conference EUSIPCO, 3-9 September 2007, Poznan, Poland.       

Belkhatir M. (2007), "Graph-Based Indexing and Querying on Image Corpora with Unified Visual Semantic and Relational Descriptions", International Conference on Asian Digital Libraries (ICADL 2007), 10-13 December, Hanoi, Vietnam      

Belkhatir M. (2007), “Coupling Visual Semantics and High-Level Relational Characterisation within a Transparent and Penetrable Image Retrieval Framework", International Conference on Tools with Artificial Intelligence (ICTAI), 29-31 October, Patras, Greece.           

Belkhatir M. and Charhad M. (2007), "A Conceptual Framework for Automatic Text-Based Indexing and Retrieval in Digital Video Collections", 18th International Conference on Database and Expert Systems Applications (DEXA 2007), 3-7 September, Regensburg, Germany   

Lee K.M., Loe K.F., Lee P.A. & Sanei S. (2007), “A Comparison of the Basic Temporal Features of Fronto-Normal and Fronto-Parallel Gait”, 2007 15th International Conference on Digital Signal Processing (DSP 2007), 1-4 July, Wales, United Kingdom.


[+/-]Staff

[+/-]Higher Degree by Research (HDR) students

1) Name:Lee Kah Mein

Email: kmlee8@student.monash.edu

Supervisors:
Dr Mohammed Belkhatir (Main)
Professor Lee Poh Aun (Joint)


[+/-]Project Title: Robust Combination of Biometrics for Human Identification in Video

Biometrics are used to identify people in a non-intrusive way and should be obtained in a timely and accurate fashion. A typical scenario for automated recognition of humans is when people have to queue up to access a facility. They traverse a passage to be identified where commonly a single camera is placed. This provides a fronto-normal view of a person. The physical space needed is small and the application is natural, practical and useful. Since the video stream provides face and gait images, these provide spatial and spatio-temporal features respectively. Face features which are short-range biometrics, can be naturally combined with gait, which is a medium-range biometric using pattern recognition techniques. This combination is a relatively new field of research and provides a more robust identification as well as reducing error rates in recognition. Video streams can also provide various other biometrics like eyes, nose, and static gait features like body height and width to help in the identification process. By itself, gait is a relatively new biometric. Obtaining features from a video stream of fronto-normal gait provides dynamic data which can be subject to Time Series Analysis. Tracking body features from this viewpoint can be challenging because of the looming effect and occlusions.

Face recognition is a relatively mature field. We will use existing techniques from this field to validate the effects of biometric classifier combination. Initial experiments have shown promising results. We have used nonlinear dynamical analysis on fronto-normal gait, to improve face recognition rates.

2) Name: Wan Fariza Paizi @ Fauzi

Email: wfpai1@student.monash.edu

Supervisors:
Dr Mohammed Belkhatir (Main)
Dr Saadat Alhashmi (Joint)


[+/-]Project Title: Integrating the Surrounding Image Information within a High-Level Conceptual Framework for Symbolic Image Indexing and Retrieval on the WWW
The key problem in image retrieval systems basically bogs down to getting relevant images. To get a relevant image, an image must of course be indexed correctly. With the rapidly growing numbers of digital images, manually indexing all images in an image corpus sounds extremely overwhelming. Manual indexing or annotation is not only tedious but it might have inconsistencies and is highly subjective. Another school of thought is that an image should be searched, based on its content and not on the annotated text. And yes, logically, we should get a better (i.e. a more relevant) search results and we did. Earlier content based systems were able to index images with the visual features, also termed as the signal properties, such as its colour, texture and shape. With these old systems, users would have to query based on colour, shape, or texture. Querying an image that has red colour and a square shape would give the result of a red box, a red card, part of a red car, or even a red house, and the list goes on. Hence, the result is relevant visually but not semantically (the user might have just wanted a red box). Then, methods emerge to map these low-level visual features to high level semantic concepts. For example, brown rectangular shape and green circle would be a tree. Users could now query tree. However, the mappings that need to be done, once again, sound overwhelming and the visual information is lost!  This brings us to the approach that integrates both visual features and semantic concepts. This approach complements the previous two approaches. Nevertheless, mapping of visual features to semantic concepts is also involved therefore it only manipulates restrained and fixed sets of semantic concepts. It is very difficult to elicit mappings between extracted visual features and big sets of semantic concepts. We propose a new model based on the last approach, a signal/semantic based conceptual framework, which takes into account the surrounding image information in web pages as an enrichment source for the indexing of images. Concepts and relations describing the image visual content are automatically extracted and then integrated in its description. We will target semantic concepts which are more specific (considering a hypernymy or is_a relationship) than the ones obtained through the first process of semantic-based characterization since we consider that they provide a richer and more refined description of the image content. Since the process of automatic image indexing is error-prone, another task will consist in correcting the potential erroneous semantic indexes introduced by the semantic indexing process. Experimentally, we will evaluate our proposal in the framework of the international INEX multimedia evaluation campaign.

3) Name: Bhawani Selvaretnam

Email: bss1@student.monash.edu

Supervisors:
Dr Saadat Alhashmi (Main)
Dr Mohammed Belkhatir (Joint)


[+/-]Project Title: A generic multi-agent framework for effective decision making within collaborative systems in dynamic environments
Agent-based computing is highly inter-disciplinary and it is utilized across multiple application domains such as telecommunications, manufacturing, information retrieval and management, transportation and logistics and many more.  Multi-agent systems are designed to work cooperatively to solve problems which may be beyond the capabilities of any individual agents. We aim to formulate a generic multi-agent framework for better decision making within collaborative systems in dynamic environments. There is an essential correlation between the architecture of a decision making multi-agent system and the environmental conditions it operates in. In a dynamic environment, this correlation must be managed through adaptive responses to the varying requirements of the system.  As such, we attempt to develop a multi agent planning methodology that formulates cooperative execution plans based on individual and joint goals of agents and devise a decision making model that would pave the way towards better learning and reasoning for making informed decisions, despite risks and uncertainties, within dynamic collaborative systems.

4) Name: Bashar Tahayna

Email: bmtah1@student.monash.edu

Supervisors:
Dr Mohammed Belkhatir (Main)
Dr Saadat Alhashmi (Joint)


[+/-]Project Title: Strongly Coupling Visual and Audio / Speech Characterizations within a Fully-Automated Conceptual Video Indexing and Retrieval Architecture: An Application to the TRECVID International Evaluation Campaign

Proliferation of video documents leads to an emerging demand for increasingly sophisticated and “intelligent” video indexing and retrieval systems. However, current solutions are still immature and lack of any widely-adopted standards. The indexing and accessing tasks require new architectures taking into account the several characteristics of video data, such as its size, rich content, as well as its visual, spatial and temporal features. As a matter of fact, addressing the problem of combining modalities for video indexing and retrieval is of huge importance and seen as the only solution for achieving significant retrieval performance. The PhD research investigates the specification of a multi-facetted conceptual framework based on an integrated multi-modal approach for video indexing and retrieval. It is based on combining multiple features extracted from the visual, audio and temporal modalities. It moreover relies on an expressive representation formalism handling high-level video descriptions and a full-text query framework in an attempt to retrieve video documents beyond systems relying on low-level features, keyword-annotation frameworks and state-of-the-art architectures loosely-coupling visual and auditory descriptions.


[+/-]Research Grants

Principal investigator

Project Title

Source

Dr Mohammed Belkhatir

On the use of symbolic signal information for concept-based indexing and retrieval of the visual multimedia content. An experimental validation on the image corpus of the international CLEF evaluation campaign

Monash Sunway campus Seed Funding (2008)

Dr Mohammed Belkhatir

A Conceptual Framework for Automatic Text-Based Indexing and Retrieval in Digital Video Collections. An Experimental Validation on the Multimedia Corpus of the International TRECVID Evaluation Campaign

Monash Sunway campus Seed Funding (2008)

Dr Mohammed Belkhatir

Boosting Intelligent Image Retrieval on the WWW. An experimental validation on the multimedia corpus of the international INEX evaluation campaign

Monash Sunway campus Seed Funding (2007)

Professor Lee Poh Aun

The Grid Cluster for Distributed Imaging Applications

ICT Project Fund

Professor Lee Poh Aun

Image Analysis Using Discrete Orthogonal Polynomials

ICT Project Fund


   

research