Страница публикации

Multi-Agent Algorithm for Re-Allocating Grid-Resources and Improving Fault-Tolerance of Problem-Solving Processes

Авторы: Feoktistov A., Kostromin R., Sidorov I., Gorsky S., Oparin G.

Журнал: Procedia Computer Science: Proc. of the 13th Intern. Symposium on Intelligent Systems (INTELS’2018, St. Petersburg, October 22-24, 2018)

Том: 150

Номер:

Год: 2019

Отчётный год: 2019

Издательство: LETI

Местоположение издательства: Санкт-Петербург

URL:

Аннотация: Nowadays, a provision of the computational process fault-tolerance in Grid is a relevant issue. In the paper, we address a fault-tolerance improvement in solving large-scale scientific and applied problems that are implemented through modular programming in heterogeneous distributed computing environments. We describe a computational process by an abstract program (problem-solving scheme) that correlates to a workflow. The problem-solving scheme specifies modules (applied software) and their relations with each other. This paper proposes a new multi-agent algorithm for re-allocating Grid-resources when the computational process fails. The algorithm execution involves forming a residual problem-solving scheme using methods of the abstract program specialization and reallocating its modules between agents that represent computational resources. In comparison to the known algorithms for the same purpose, the proposed algorithm implements an adaptive multi-scenario solving this issue and therefore increases a degree of computational process fault-tolerance. Extensive modeling and practical experiments demonstrate the practicability of the proposed algorithm.

Индексируется WOS: 0

Индексируется Scopus: 1

Индексируется РИНЦ: 1

Публикация в печати: 0

Добавил в систему: