Asymmetrical estimator for training encapsulated deep photonic neural networks

Wang, Yizhi; Chen, Minjia; Yao, Chunhui; Ma, Jie; Yan, Ting; Penty, Richard; Cheng, Qixiang

Computer Science > Machine Learning

arXiv:2405.18458 (cs)

[Submitted on 28 May 2024 (v1), last revised 17 Nov 2024 (this version, v3)]

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Authors:Yizhi Wang, Minjia Chen, Chunhui Yao, Jie Ma, Ting Yan, Richard Penty, Qixiang Cheng

View PDF

Abstract:Photonic neural networks (PNNs) are fast in-propagation and high bandwidth paradigms that aim to popularize reproducible NN acceleration with higher efficiency and lower cost. However, the training of PNN is known to be a challenge, where the device-to-device and system-to-system variations create imperfect knowledge of the PNN. Despite backpropagation (BP)-based training algorithms often being the industry standard for their robustness, generality, and fast gradient convergence for digital training, existing PNN-BP methods rely heavily on the accurate intermediate state extraction for a deep PNN (DPNN). These information accesses truncate the photonic signal propagation, bottlenecking DPNN's operation speed and increasing the system construction cost. Here, we introduce the asymmetrical training (AT) method, tailored for encapsulated DPNNs, where the signal is preserved in the analogue photonic domain for the entire structure. AT's minimum information readout for training bypasses analogue-digital interfaces wherever possible for fast operation and minimum system footprint. AT's error tolerance and generality aim to promote PNN acceleration in a widened operational scenario despite the fabrication variations and imperfect controls. We demonstrated AT for encapsulated DPNN with integrated photonic chips, repeatably enhancing the performance from in-silico BP for different network structures and datasets.

Comments:	22 pages, 6 figures
Subjects:	Machine Learning (cs.LG); Optics (physics.optics)
MSC classes:	78-05
Cite as:	arXiv:2405.18458 [cs.LG]
	(or arXiv:2405.18458v3 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2405.18458

Submission history

From: Yizhi Wang [view email]
[v1] Tue, 28 May 2024 17:27:20 UTC (1,386 KB)
[v2] Thu, 15 Aug 2024 10:58:17 UTC (1,788 KB)
[v3] Sun, 17 Nov 2024 12:33:25 UTC (1,749 KB)

Computer Science > Machine Learning

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Asymmetrical estimator for training encapsulated deep photonic neural networks

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators