Автор |
Szepesvári, Csaba |
Автор |
Lórincz, Andràs |
Дата выпуска |
1993 |
dc.description |
A brain model-based alternative to reinforcement learning is presented that integrates artificial neural networks and knowledge-based systems into one unit or agent for goal-oriented problem solving. The agent may possess inherited and learned artificial neural networks and knowledge-based subsystems. The agent has and develops ANN cues to the environment for dimensionality reduction (data compression) to ease the problem of combinatorial explosion. Here, a dynamical concept model is put forward that builds cue models of the phenomena in the world, designs dynamical action sets (concepts), and makes them compete in a spreading-activation neural stage to reach decision. The agent works under closed-loop control. Here we examine a simple robotlike object in a two-dimensional conditionally probabilistic space. |
Издатель |
Sage Publications |
Тема |
adaptivity |
Тема |
artificial neural networks |
Тема |
knowledge-based system; self-organization |
Тема |
activation spreading |
Тема |
autonomous system |
Название |
Behavior of an Adaptive Self-organizing Autonomous Agent Working with Cues and Competing Concepts |
Тип |
Journal Article |
DOI |
10.1177/105971239300200202 |
Print ISSN |
1059-7123 |
Журнал |
Adaptive Behavior |
Том |
2 |
Первая страница |
131 |
Последняя страница |
160 |
Аффилиация |
Szepesvári, Csaba, Jànos Bolyai Institute of Mathematics |
Аффилиация |
Lórincz, Andràs, Hungarian Academy of Sciences |
Выпуск |
2 |
Библиографическая ссылка |
Agranat, A.J., Neugebauer, C.F., & Yariv, Y. (1990). The CCD neural processor: A neural network integrated circuit with 65536 programmable synapses. IEEE Transactions on Circuits and Systems, 37, 1073-1075. |
Библиографическая ссылка |
Agranat, A.J., & Yariv, Y. (1987). Semiparallel microelectronic implementation of neural network models using CCD technology. Electronics Letters , 23, 580-582. |
Библиографическая ссылка |
Anderson, C.W. (1987). Strategy learning with multilayer connectionist representation. In Proceedings of the Fourth International Workshop on Machine Learning, Ann Arbor, MI. |
Библиографическая ссылка |
Barto, A., Bradtke, S., & Singh, S. (1991). Real-time learning and control using asynchronous dynamic programming (Tech. Rep. No. 91-57). Boston: Computer Science Department, University of Massachusetts. |
Библиографическая ссылка |
Bundy, A. (Ed.). (1990). Catalogue of artificial intelligence techniques. (3rd ed.). Heidelberg : Springer-Verlag. |
Библиографическая ссылка |
Carpenter, G.A., & Grossberg, S.A. (1987). Massively parallel architecture for self-organizing neural pattern recognition machine. Computer Vision, Graphics, and Image Processing, 37, 54-115. |
Библиографическая ссылка |
Chapman, D., & Kaelbling, L.P. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of the International Joint Conference on Artificial Intelligence, Sydney, Australia. |
Библиографическая ссылка |
Cloak, E.T., Jr. (1975). Is cultural ecology possible? Human Ecology, 3, 161-182. |
Библиографическая ссылка |
Collins, A., & Loftus, E. (1975). A spreading activation theory of semantic processing . Psychological Review, 82, 407-428. |
Библиографическая ссылка |
Csànyi, V. (1982). General theory of evolution. Budapest: Akadémiai Könyvkiadó. |
Библиографическая ссылка |
Földiák, P. (1991). Learning invariance from transformation sequences . Neural Computation, 3(2), 194-200. |
Библиографическая ссылка |
Fomin, T., & Lórincz, A. (1993). On the potential of Hebbian and anti-Hebbian learning . Manuscript submitted for publication. |
Библиографическая ссылка |
Fukushima, K. (1992). Character recognition with neural networks. Neural Computing, 4, 221-233. |
Библиографическая ссылка |
Grossberg, S.A. (1968). Some nonlinear networks capable of learning a spatial pattern of arbitrary complexity. Proceedings of the National Academy of Sciences, 59, 368-372. |
Библиографическая ссылка |
Hebb, D.O. (1949). The organization of behavior. New York: Wiley |
Библиографическая ссылка |
Hertz, J., Krogh, A., & Palmer, R.G. (1991). Introduction to the theory of neural computation . Redwood City: Addison-Wesley . |
Библиографическая ссылка |
Hornik, K., Stinchcombe, M., & White, H. (1989). Multilayer feedforward networks are universal approximators. Neural Networks, 2, 359-366. |
Библиографическая ссылка |
Huberman, B.A., & Hogg, T. (1987). Phase transition in artificial intelligence systems . Artificial Intelligence, 33, 155-171. |
Библиографическая ссылка |
Judd, J.S. (1990). Neural network design and the complexity of learning . Cambridge, MA: MIT Press. |
Библиографическая ссылка |
Korf, R.E. (1990). Real time heuristic search. Artificial Intelligence, 42, 189-211. |
Библиографическая ссылка |
Lin, L.-J. (1990). Self-improving reactive agents: Case studies of reinforcement learning framework. (Tech. Rep. No. CMU-CS-90-109). Pittsburgh: Carnegie-Mellon University . |
Библиографическая ссылка |
Maes, P. (1992). Learning behavior networks from experience. In F. J. Varela & P. Bourgine (Eds.), Toward a practice of autonomous systems: Proceedings of the First European Conference on Artificial Life. Cambridge, MA: MIT Press. |
Библиографическая ссылка |
Olshausen, B., Anderson, C., & Van Essen, D. (1993). A neural model of visual attention and invariant pattern recognition. Manuscript submitted for publication. |
Библиографическая ссылка |
Peng, Y., & Reggia, J.A. (1989). A connectionist model for diagnostic problem solving . IEEE Transactions on Systems, Man and Cybernetics, 19, 285-298. |
Библиографическая ссылка |
Sutton, R.S. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3, 9-44. |
Библиографическая ссылка |
Szepesvàri, C., Balàzs, L., & Lôrincz, A. (in press). Topology learning solved by extended objects: A neural network model. Neural Computation. |
Библиографическая ссылка |
Thagard, P. (1989). Explanatory coherence. Behavioral and Brain Sciences, 12, 435-467. |
Библиографическая ссылка |
Varela, F. J., & Bourgine, P. (Eds.) (1992). Toward a practice of autonomous systems: Proceedings of the First European Conference on Artificial Life. Cambridge, MA: MIT Press. |
Библиографическая ссылка |
Watkins, C.J.C.H. (1989). Learning from delayed reward. Unpublished doctoral dissertation, Kings College, Cambridge , England. |
Библиографическая ссылка |
Widrow, B., Gupta, N.K., & Maitra, S. (1973). Punish/reward: Learning with critic in adaptive threshold systems. IEEE Transactions on Systems, Man and Cybernetics SMC-3, 455-465. |