The batch primary components transformer and auto-plasticity learning linear units architecture: synthetic image generation case
dc.contributor.author | Selitskiy, Stanislav | |
dc.contributor.author | Inoue, Chihiro | |
dc.contributor.author | Schetinin, Vitaly | |
dc.contributor.author | Jakaite, Livija | |
dc.date.accessioned | 2024-02-02T10:24:05Z | |
dc.date.available | 2024-02-02T10:24:05Z | |
dc.date.issued | 2024-01-02 | |
dc.identifier.citation | Selitskiy S, Inoue C, Schetinin V, Jakaite L (2024) 'The batch primary components transformer and auto-plasticity learning linear units architecture: synthetic image generation case', 2023 Tenth International Conference on Social Networks Analysis, Management and Security (SNAMS) - Abu Dhabi, IEEE. | en_US |
dc.identifier.doi | 10.1109/SNAMS60348.2023.10375471 | |
dc.identifier.uri | http://hdl.handle.net/10547/626147 | |
dc.description.abstract | Context tokenizing, which is popular in Large Language and Foundation Models (LLM, FM), leads to their excessive dimensionality inflation. Traditional Transformer models strive to reduce intractable excessive dimensionality at the among-token attention level, while we propose additional between-dimensions attention mechanism for dimensionality reduction. A novel Transformer-based architecture is presented, which aims at the individual dimension attention and, by doing so, performs the implicit relevant primary components' feature selection in artificial neural networks (ANN). As an additional mechanism allowing adaptive plasticity learning in ANN, a neuron-specific Learning Rectified Linear Unit layer is proposed for further feature selection via weight decay. The performance of the presented layers is tested on the encoder-decoder architecture applied for the synthetic image generation task for the benchmark MNIST data set. | en_US |
dc.language.iso | en | en_US |
dc.publisher | IEEE | en_US |
dc.relation.url | https://ieeexplore.ieee.org/document/10375471 | en_US |
dc.subject | transformer | en_US |
dc.subject | feature selection | en_US |
dc.subject | cosine distance | en_US |
dc.subject | ANN plasticity | en_US |
dc.subject | catastrophic forgetting | en_US |
dc.subject | learning ReLU | en_US |
dc.title | The batch primary components transformer and auto-plasticity learning linear units architecture: synthetic image generation case | en_US |
dc.type | Conference papers, meetings and proceedings | en_US |
dc.contributor.department | University of Bedfordshire | en_US |
dc.date.updated | 2024-02-02T10:21:19Z | |
dc.description.note |