Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts

Szepesvári, Csaba; Lórincz, Andràs

Смотреть

Весь архив
Текущую коллекцию

Главная
Коллекции, полученные в рамках Государственного контракта №07.551.11.4002
Издательство SAGE Publications
Посмотреть элемент

Автор	Szepesvári, Csaba
Автор	Lórincz, Andràs
Дата выпуска	1993
dc.description	A brain model-based alternative to reinforcement learning is presented that integrates artificial neural networks and knowledge-based systems into one unit or agent for goal-oriented problem solving. The agent may possess inherited and learned artificial neural networks and knowledge-based subsystems. The agent has and develops ANN cues to the environment for dimensionality reduction (data compression) to ease the problem of combinatorial explosion. Here, a dynamical concept model is put forward that builds cue models of the phenomena in the world, designs dynamical action sets (concepts), and makes them compete in a spreading-activation neural stage to reach decision. The agent works under closed-loop control. Here we examine a simple robotlike object in a two-dimensional conditionally probabilistic space.
Издатель	Sage Publications
Тема	adaptivity
Тема	artificial neural networks
Тема	knowledge-based system; self-organization
Тема	activation spreading
Тема	autonomous system
Название	Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts
Тип	Journal Article
DOI	10.1177/105971239300200202
Print ISSN	1059-7123
Журнал	Adaptive Behavior
Том	2
Первая страница	131
Последняя страница	160
Аффилиация	Szepesvári, Csaba, Jànos Bolyai Institute of Mathematics
Аффилиация	Lórincz, Andràs, Hungarian Academy of Sciences
Выпуск	2
Библиографическая ссылка	Agranat, A.J., Neugebauer, C.F., & Yariv, Y. (1990). The CCD neural processor: A neural network integrated circuit with 65536 programmable synapses. IEEE Transactions on Circuits and Systems, 37, 1073-1075.
Библиографическая ссылка	Agranat, A.J., & Yariv, Y. (1987). Semiparallel microelectronic implementation of neural network models using CCD technology. Electronics Letters , 23, 580-582.
Библиографическая ссылка	Anderson, C.W. (1987). Strategy learning with multilayer connectionist representation. In Proceedings of the Fourth International Workshop on Machine Learning, Ann Arbor, MI.
Библиографическая ссылка	Barto, A., Bradtke, S., & Singh, S. (1991). Real-time learning and control using asynchronous dynamic programming (Tech. Rep. No. 91-57). Boston: Computer Science Department, University of Massachusetts.
Библиографическая ссылка	Bundy, A. (Ed.). (1990). Catalogue of artificial intelligence techniques. (3rd ed.). Heidelberg : Springer-Verlag.
Библиографическая ссылка	Carpenter, G.A., & Grossberg, S.A. (1987). Massively parallel architecture for self-organizing neural pattern recognition machine. Computer Vision, Graphics, and Image Processing, 37, 54-115.
Библиографическая ссылка	Chapman, D., & Kaelbling, L.P. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of the International Joint Conference on Artificial Intelligence, Sydney, Australia.
Библиографическая ссылка	Cloak, E.T., Jr. (1975). Is cultural ecology possible? Human Ecology, 3, 161-182.
Библиографическая ссылка	Collins, A., & Loftus, E. (1975). A spreading activation theory of semantic processing . Psychological Review, 82, 407-428.
Библиографическая ссылка	Csànyi, V. (1982). General theory of evolution. Budapest: Akadémiai Könyvkiadó.
Библиографическая ссылка	Földiák, P. (1991). Learning invariance from transformation sequences . Neural Computation, 3(2), 194-200.
Библиографическая ссылка	Fomin, T., & Lórincz, A. (1993). On the potential of Hebbian and anti-Hebbian learning . Manuscript submitted for publication.
Библиографическая ссылка	Fukushima, K. (1992). Character recognition with neural networks. Neural Computing, 4, 221-233.
Библиографическая ссылка	Grossberg, S.A. (1968). Some nonlinear networks capable of learning a spatial pattern of arbitrary complexity. Proceedings of the National Academy of Sciences, 59, 368-372.
Библиографическая ссылка	Hebb, D.O. (1949). The organization of behavior. New York: Wiley
Библиографическая ссылка	Hertz, J., Krogh, A., & Palmer, R.G. (1991). Introduction to the theory of neural computation . Redwood City: Addison-Wesley .
Библиографическая ссылка	Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2, 359-366.
Библиографическая ссылка	Huberman, B.A., & Hogg, T. (1987). Phase transition in artificial intelligence systems . Artificial Intelligence, 33, 155-171.
Библиографическая ссылка	Judd, J.S. (1990). Neural network design and the complexity of learning . Cambridge, MA: MIT Press.
Библиографическая ссылка	Korf, R.E. (1990). Real time heuristic search. Artificial Intelligence, 42, 189-211.
Библиографическая ссылка	Lin, L.-J. (1990). Self-improving reactive agents: Case studies of reinforcement learning framework. (Tech. Rep. No. CMU-CS-90-109). Pittsburgh: Carnegie-Mellon University .
Библиографическая ссылка	Maes, P. (1992). Learning behavior networks from experience. In F. J. Varela & P. Bourgine (Eds.), Toward a practice of autonomous systems: Proceedings of the First European Conference on Artificial Life. Cambridge, MA: MIT Press.
Библиографическая ссылка	Olshausen, B., Anderson, C., & Van Essen, D. (1993). A neural model of visual attention and invariant pattern recognition. Manuscript submitted for publication.
Библиографическая ссылка	Peng, Y., & Reggia, J.A. (1989). A connectionist model for diagnostic problem solving . IEEE Transactions on Systems, Man and Cybernetics, 19, 285-298.
Библиографическая ссылка	Sutton, R.S. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3, 9-44.
Библиографическая ссылка	Szepesvàri, C., Balàzs, L., & Lôrincz, A. (in press). Topology learning solved by extended objects: A neural network model. Neural Computation.
Библиографическая ссылка	Thagard, P. (1989). Explanatory coherence. Behavioral and Brain Sciences, 12, 435-467.
Библиографическая ссылка	Varela, F. J., & Bourgine, P. (Eds.) (1992). Toward a practice of autonomous systems: Proceedings of the First European Conference on Artificial Life. Cambridge, MA: MIT Press.
Библиографическая ссылка	Watkins, C.J.C.H. (1989). Learning from delayed reward. Unpublished doctoral dissertation, Kings College, Cambridge , England.
Библиографическая ссылка	Widrow, B., Gupta, N.K., & Maitra, S. (1973). Punish/reward: Learning with critic in adaptive threshold systems. IEEE Transactions on Systems, Man and Cybernetics SMC-3, 455-465.

Читать

1.464Мб

Скрыть метаданые

Смотреть

Весь архив

Текущую коллекцию