Information Area:
This special issue is organized by Dr. Jehad Mohamad ALJA’AM with the collaboration of Science Pub. New York

Indexed in:
Ulrich
DOAJ
Cabell
WAD
ASA
INSPEC
IEE
Genamics





Efficient Heuristics for Information Organization ...

Serving the Industry
Incremental, dynamic, and automatic information organization should be made available in most information retrieval systems or search engines to shorten the browsing process. Information organization problem, mainly a classification process is NP-complete. As a matter of fact, algorithms behind information structuring are not efficient for huge amount of data. A challenging question is to make faster heuristics for this task, to design a realistic system for browsing through huge database or by the Internet. The problem is not to make faster search engines, but to filter huge arriving data from the Internet into a structured search space. Articles included original research papers presenting the kind of proposed data structure and how it will decrease browsing and improve the quality of searching process by the Internet or through databases. Papers discussed different related sides of the problem: algorithmic, linguistic, ontological or any other useful area. The use of conceptual approaches or formal based methods was recommended.


Scope of the Special issue:

  • Automatic information structuring
  • Search browsing
  • Information filtering and reduction
  • efficient heuristics for NP-complete problems

Title:
Text Clustering for Natural Language Applications
Author(s):
SYllias Chali and Soufiane Noureddine
Source: Journal of computer Science: 1-7
Abstract: Text clustering has many uses in natural language tools and applications. For instance, summarizing sets of documents that all describe the same event requires first identifying and grouping those documents talking about the same event. Text clustering involves dividing a set of texts into
non-overlapping clusters. In this study, we present two text clustering algorithms: Grouping Algorithm and Chaining Algorithm. We compared them with k-means and the EM algorithms. The evaluation results showed that our two algorithms perform better than the k-means and EM algorithms in different experiments.
 
Title: Efficient Simulator Based on Meta-Heuristic forFMS and AGV Systems Design and Control
Author(s):
Slim Ben Saoud, Amel Jaoua and Narjčs Bellamine-Ben Saoud
Source: Journal of computer Science :8-14
Abstract:
Flexible Manufacturing Systems (FMS) based on Automated Guided Vehicles (AGV) have emerged as highly competitive manufacturing technologies of the last decade. However, their design and control are complex tasks since the decision problems related to the different system parameters such as sizing and scheduling are usually shown as NP-hard. Nowadays, computer simulation is one of the most commonly used methods for solving these problems. Now, we present a complete simulation tool which allows the design, analysis and control of FMS based on AGV. This tool is composed of three modules: (1) A simulation module which allows the estimation of the user introduced configuration. (2) An optimization module which is based on a meta-heuristic: the simulated annealing. This module is coupled with the simulation one in order to obtain an Executive Information System that is able to generate an optimal configuration. (3) A reactive control system which is based on the concept of real-time simulation and allows dynamic dispatching and routing of different systems inside the global FMS. The different developed modules have been validated by using a typical automated manufacturing system.
 
Title:
An Approach for the Satisfiability Problem via Exterior Penalty Optimization
Author(s): Soufiane Noureddine
Source: Journal of computer Science : 15-20
Abstract:
We present an iterative algorithm that promises to achieve a satisfactory solution for the central problem of computational logic and complexity theory. Clear evidence is given that the algorithm, in some circumstances, should perform well in practice though its implementation is still lacking. The method we use is optimization. The particular version of satisfiability we focus on is the exact satisfiability problem (XSAT), which is known to be NP-complete. The study presents the detailed algorithm and discusses correctness and efficiency issues.
 
Title:
An Agent-Based Testbed for Simulating Large Scale Accident Rescue Heuristics
Author(s): Narjčs Bellamine-Ben Saoud, Bernard Pavard, Julie Dugdale, Tarek Ben Mena and Mohamed Ben Ahmed
Source: Journal of computer Science : 21-26
Abstract:
The scope of this study is to present our study of the complex social problem of large scale accident rescue by applying an agent-based approach. Our field of study concerns situations involving a large number of victims over a wide area (which may or not be hostile) and where rescuers have to act rapidly to rescue the greatest number of victims in the shortest time by optimizing both their human and material resources. Based on real life observations and rescue plans on one side and designing new rescuing strategies on the other side we have built a generic and interactive user-friendly simulator. Modeling and simulation provide us with a virtual environment where we can easily develop and test a large number of “what-if” heuristic scenarios of different rescue organizations. These organizations may be compared and assessed in order to find efficient configurations and strategies for organizing a rescue.
 
Title:
Document Clustering Analysis Based on Hybrid PSO+K-means Algorithm
Author(s):
Xiaohui Cui and Thomas E. Potok
Source: Journal of computer Science : 27-33
Abstract:
There is a tremendous proliferation in the amount of information available on the largest shared information source, the World Wide Web. Fast and high-quality document clustering algorithms play an important role in helping users to effectively navigate, summarize and organize the information. Recent studies have shown that partitional clustering algorithms are more suitable for clustering large datasets. The K-means algorithm is the most commonly used partitional clustering algorithm because it can be easily implemented and is the most efficient one in terms of the execution time. The major problem with this algorithm is that it is sensitive to the selection of the initial partition and may converge to a local optima. In this study, we present a hybrid Particle Swarm Optimization (PSO)+K-means document clustering algorithm that performs fast document clustering and can avoid being trapped in a local optimal solution as well. For comparison purpose, we applied the PSO+K-means, PSO, K-means and other two hybrid clustering algorithms on four different text document datasets. The number of documents in the datasets range from 204 to over 800 and the number of terms range from over 5000 to over 7000. The results illustrate that the PSO+K-means algorithm can generate the most compact clustering results than other four algorithms.
 
Title:
A New Metric for Geometric Model Based Cache Invalidation of Location Dependent Data in Mobile Environment
Author(s): Ajey Kumar, Manoj Misra and A.K. Sarje
Source: Journal of computer Science : 34-40
Abstract:
Mobile computing as compared to traditional computing paradigms enables clients to have unrestricted mobility while maintaining network connections. Data management in this paradigm poses new challenging problems to the data base community. Location Dependent Information Services (LDIS) is an emergent application in this area where information provided to users depends on their current locations. Data caching at mobile clients play a key role in data management due to its ability to improve system performance and overcome availability limitations. Spatial data cached in the mobile clients may become invalid because of the movement of the client. Cache Invalidation schemes aims to keep data consistency between the client’s cache and the server. To maintain consistency of LDD in cache, valid scope of that data item is identified and stored along with it in the client’s cache. In this study, we focus on the selection procedure of finding best suitable candidate for valid scope (i.e., best suitable sub polygon of a given polygon) and propose a generalized algorithm which selects the best suitable candidate for valid scope. We compare its performance with the existing algorithms. More over, we also introduce a new metric FA and an algorithm CEFAB which tries to improve the performance by considering the user movement pattern and speculation about its future access.
 
Title:
A Heuristic Reduct Computation Approach by Attributes Weighting for Rough Set Based Classification
Author(s): Qasem A. Al-Radaideh, Md Nasir Sulaiman, Mohd Hasan Selamat and Hamidah Ibrahim
Source: Journal of computer Science : 41-47
Abstract: Rough set theory is an elegant theory for knowledge discovery and it is mainly used in the classification and knowledge reduction tasks. The theory provides the reduct and core concepts for knowledge reduction. The cost of reduct set computation is highly influenced by the attribute set size of the dataset where the problem of finding reducts has been proven as NP-hard problem. Therefore, several optimization and approximation techniques have been proposed to generate reducts. This paper proposes an approximate heuristic approach for reduct generation, which is particularly used for classification purposes. The approach uses the discernibility matrix concept and rough set based attribute weighting mechanisms in different levels of the matrix. Three weights are proposed to determine the significance of an attribute to be considered in the reduct and to break the tie when several attributes have the same significance. The approach is extensively experimented and evaluated on various standard domains.
 
Title:
Integration Techniques to Build a Data Warehouse using Heterogeneous Data Sources
Author(s): F. Boufares and S. Hamdoun
Source: Journal of computer Science : 48-55
Abstract:
This work describes the construction of a data warehouse by the integration of heterogeneous relational, object-relational and XML data (complex data). In fact, developing intelligent tools for the integration of information extracted from multiple heterogeneous sources is a challenging issue to effectively exploit the numerous sources available in global information systems. Due to the heterogeneity of the sources, various languages of interrogation and different data models are used for the warehouses. Thus, the construction of the latters can be reached by several manners. Our work is based on the extraction of the inter-schema relationships between the sources. Related to this, a global schema was generated and the views of the data warehouse were constructed. All these stages, proposed in this work were implemented by the use of a functional prototype.
 
Title: Data Mining Meets Evolutionary Computation: A New Framework for Dynamic and Scalable Evolutionary Data Mining based on Non-Stationary Function Optimization
Author(s):
M. Haydari, M. M. Moksin, N. Yahya, W. M. M. Yunus and V. I. Grozescu
Source: Journal of computer Science : 56-63
Abstract:
Data mining has recently attracted attention as a set of efficient techniques that can discover patterns from huge data. More recent advancements in collecting massive evolving data streams created a crucial need for dynamic data mining. In this paper, we present a genetic algorithm based on a new representation mechanism that allows several phenotypes to be simultaneously expressed to different degrees in the same chromosome. This gradual multiple expression mechanism can offer a simple model for a multiploid representation with self-adaptive dominance, including co-dominance and incomplete dominance. Based on this model, we also propose a data mining approach that considers the data as a reflection of a dynamic environment and investigate a new evolutionary approach based on continuously mining non-stationary data sources that do not fit in main memory. Preliminary experiments are performed on real Web clickstream data.
 
Title: Modelling the Timetabling Problem Using Goal Programming
Author(s): Jihad Mohamad Jaam, Mohamed Larbi Rebaiaia and Ahmad Mojahid Hasnah
Source: Journal of computer Science : 64-71
Abstract: The school timetabling problem is a well known combinatorial hard problem. It consists mainly of allocating timeslots to lectures from a limited number of available timeslots taking into consideration the number of teachers and classrooms. Many other soft constraints may also be present, like teachers wishes, distances between different classrooms, break between lectures, classrooms locations, etc. However, a solution can be accepted without satisfying all the soft constraints and an optimal solution is obtained whenever all the pre-determined constraints are satisfied. Many heuristic algorithms and modelling approaches have been proposed to solve the timetabling problem. However, they deal with particular instances of the problem and no general solution can be found for all instances. In this research we propose a new multi-objective model for the timetabling problem using goal programming. We show that our model is very flexible and many other soft constraints related to the timetabling problem can be added easily to the model. Our experiments in solving the proposed model show that the obtained results are very promising.