Electrically programmable phase-change photonic

Full text

Turn on search term navigation

1. Introduction

In recent years, neural networks based on central processing units (CPUs) have been used in mobile phones for speech recognition and image classification,¹ but they are still in their infancy in more sophisticated and expansive application fields where massive amounts of data should be processed in real time, such as autonomous driving² and computer vision.³ Optical neural networks (ONNs) based on photonic integrated circuits (PICs)⁴^–⁹ have the potential to meet this demand as a consequence of their low latency, high parallel (e.g., wavelength/spatial division multiplexing), and strong anti-electromagnetic interference capability of PICs, as well as the low cost and high yield provided by a complementary metal-oxide-semiconductor (CMOS) fabrication process.¹⁰^–¹³ Recently, a series of ONNs have been demonstrated for artificial intelligence, including vowel recognition,¹⁴ perceptron,¹⁵^,¹⁶ pattern recognition,¹⁷ and image classification.¹⁸^,¹⁹ However, for real-world applications, more efforts are needed to improve the energy efficiency, scalability, and algorithm accuracy of ONNs.

In on-chip ONNs, weights are determined by basic units of PICs altering their optical phase²⁰ or intensity.²¹ These basic units commonly employ the thermo-optic (TO) effect, free-carrier dispersion effect, or nano-opto-electromechanical systems,²²^,²³ suffering from severe heat accumulation, high static power consumption or/and large footprint, which constrains the scalability of programmable photonic networks. On-chip integrated photonic memories, which can retain specific optical states after training (referring to all types of training), are anticipated to be embedded in programmable PICs to reduce or even eliminate static power consumption. Chalcogenide phase-change materials (PCMs) are promising candidates for zero static power-consumption photonic memories due to their reversible amorphous-crystalline phase transition,²⁴^–²⁶ and exceptional long-term, self-sustaining capability.²⁷ Moreover, the high optical contrast ([Formula omitted. See PDF]) of PCMs between their covalent-bonded amorphous and resonant-bonded crystalline states makes ultracompact photonic memories achievable. Compared with photonic memories based on charge trapping,²⁸ and ferroelectric domain configuration,²⁹ or programmable nodes of PICs based on latched micromechanical systems,³⁰ photonic memories and nonvolatile PICs based on PCMs have the advantages of high stability, low loss, and especially small footprint. In the past decade, PCM-based integrated photonic memory (PM) has been demonstrated by adopting GeSbTe,³¹^–³⁶ GeSbSeTe,³⁷ SbS,³⁸ SbSe,³⁹^,⁴⁰ etc. On-chip light-induced reconfigurable GST-based PM and its application in an ONN have been demonstrated.⁴¹ However, for low-loss PCMs such as SbSe, optically induced reprogramming is inapplicable for scalable networks due to the negligible absorption loss at the telecom C-band. Electrothermal control of PCM not only addresses this issue but also has the potential for constructing large-scale nonvolatile programmable PICs. This makes electrically programmable PCM-based PICs much coveted in the future of high-efficiency and large-scale ONNs.⁴²

On the other hand, in situ training (referring to training the ONN directly in the optical domain) is a potent remedy for enhancing the accuracy of algorithm execution in integrated ONNs,⁴³^–⁴⁵ which can not only improve the training speed but also reduce the influence of manufacturing errors and electrical/thermal cross talk.⁴⁶ However, although PCM-integrated photonic memories can make PICs highly energy-efficient after training, their long switching time and high switching energy consumption make them unsuitable for in situ training of ONNs, which hampers more accurate algorithm operation. Hence, an energy-efficient PM that could achieve high-speed volatile modulation at the same time is not only necessary but also pivotal, especially for in situ training of sporadic reprogramming ONNs exemplified by convolutional neural networks (CNNs).

Wavelength division multiplexing (WDM)-based computing is a potential arena for implementing optical CNNs.⁴⁷ Combined with the nonvolatile modulation of PCM, zero-static power consumption optical CNNs can be achievable.⁴⁸ Moreover, the combination of WDM and frequency comb makes ONNs with more complex functionality achievable.⁴⁹ Increasing the number of WDM channels can increase the amount of parallel computation of optical computing. The [Formula omitted. See PDF] waveband is a promising candidate for expanding the number of channels thanks to the ignorable two-photon absorption at the [Formula omitted. See PDF] waveband of silicon⁵⁰ and the higher free-carrier dispersion effect of silicon at [Formula omitted. See PDF].⁵¹

To date, to the best of our knowledge, nanosecond in situ training-compatible multilevel PM has not yet been studied. Here, we address these challenges by demonstrating an electrically programmable phase-change PM for ONNs. In this work, by integrating a low-loss PCM [Formula omitted. See PDF] with a p–i–n (PIN)-diode-embedded micro-ring resonator (MRR), a [Formula omitted. See PDF] multilevel PM with more than 5 bits was demonstrated, and any specific intermediate optical state can be configured from an unknown state by applying certain electrical pulses. Meanwhile, volatile modulation with a speed of 15.2 MHz was enabled by keeping the driving voltage of the waveguide-integrated PIN diode under the threshold for triggering the phase change of the PCM. Such photonic memories can simultaneously realize in situ training and data storage in PICs for ONNs. In addition, this work provides a new paradigm for constructing CMOS-compatible, electrically programmable, nonvolatile on-chip photonic accelerators with high-speed in situ training capability, which we believe would contribute to the further development of energy-efficient, large-scale, high-yield ONNs.

2. Device Design

Figure 1(a) shows a schematic diagram of the PM enabling in situ training of ONNs. A patch of [Formula omitted. See PDF] phase-change thin film with a thickness of 30 nm was covered on a 600 nm-wide 150 nm-etched silicon waveguide, forming a low-loss [Formula omitted. See PDF] hybrid waveguide configuration similar to what we previously demonstrated in Ref. 52. When the phase transition of [Formula omitted. See PDF] occurs, it modifies the refractive index of the PCM patch and the effective refractive index ([Formula omitted. See PDF]) of the hybrid waveguide, which alters the resonant peak of the microring, thus changing the optical output of the PM. A 30 nm-thick [Formula omitted. See PDF] film was capped on the top to avoid oxidization of [Formula omitted. See PDF] during phase switching. A PIN diode was embedded in the silicon waveguide to not only support fast volatile modulation but also induce phase transition of the PCM above the waveguide by resistive heating.

Figure 1(b) depicts how PCM-integrated photonic memories operate in on-chip ONNs. Before the in situ training began, PCM patches of the photonic memories in an ONN were all initialized to the crystalline state. This was achieved by heating the PCM up via the PIN diode to a temperature higher than its crystallization temperature ([Formula omitted. See PDF]) and holding for a period of time, for instance, 1 ms. During in situ training, the PIN diode in each PM was driven by a relatively low driving voltage, realizing the free-carrier dispersion effect-based volatile modulating, thus updating the weight in nanoseconds while keeping the temperature of the PCM below its crystallization temperature ([Formula omitted. See PDF]). After in situ training, the trained weight information from volatile modulation was written into PCM-integrated memories by the ohmic heating effect of the PIN diode. To realize multibit memory, PCM was melted and then rapidly quenched, further heated to various temperatures between [Formula omitted. See PDF] and [Formula omitted. See PDF] (melting temperature) to partially crystallize to a certain optical state. After weights are written into PM, the on-chip ONN can compute passively, i.e., maintaining the weight info without power consumption.

The design of the PIN microheater is the key to the PCM-integrated PM. Since we employed standard concentrations of ion implantation in a multiproject wafer (MPW) run offered by the Institute of Microelectronics of the Chinese Academy of Sciences (IMCAS), the distance between the [Formula omitted. See PDF] heavily doping area and waveguide core was designed to balance the insertion loss and heating efficiency. The propagation loss of our PIN-diode-embedded waveguide is simulated to be [Formula omitted. See PDF] and experimentally measured to be [Formula omitted. See PDF] (see Sec. S1 in the Supplementary Material). Figure 1(a) shows the distribution of the thermal field in the PM when a 6 V/500 ns voltage pulse is applied. It could be seen that the PIN diode can effectively heat the PCM up to a certain temperature and induce a corresponding phase change by applying specific electrical pulses.

To separately manipulate volatile modulation and nonvolatile storage, electric pulses needed to be studied. According to our simulation, the bias current applied for fast volatile modulation based on the free-carrier dispersion effect should be lower than 5.84 mA to avoid the TO effect (see Sec. S2 in the Supplementary Material). At this point, the temperature of the whole waveguide region was simulated to be lower than 355 K, far below the crystallization temperature of [Formula omitted. See PDF].

To write data to the PM, the driving voltage and pulse duration are the main parameters that need to be carefully designed and optimized. The longest pulse duration (or switching speed) of a PCM-based PM is limited by the crystallization process. Figure 1(c) shows the crystallization temperature of an [Formula omitted. See PDF] patch on the PIN diode with applied single pulses of different voltages and durations. It could be seen that the pulse duration needed for crystallization could be shortened by appropriately increasing the driving voltage, considering that the driving voltage required for crystallization is relatively low. In contrast, the highest driving voltage needed for a PCM-based PM depends on the amorphization process due to higher [Formula omitted. See PDF] than [Formula omitted. See PDF], as shown in Fig. 1(d). However, the voltage of the amorphization pulse cannot be arbitrarily lowered by prolonging the pulse duration. On the one hand, the thermal decay rate of the system has to be larger than the critical cooling rate⁵³ to avoid recrystallization, yet the thermal decay time of the system is simulated to increase with the prolonged pulse duration. On the other hand, continuous increasing of the pulse duration with a certain voltage amplitude ultimately leads to thermal saturation, and an overlong pulse duration brings about limited benefits. Hence, the duration of amorphization pulses is limited to within [Formula omitted. See PDF] in our design. It could be seen from Fig. 1(d) that the driving voltage could be optimized down to 5 V theoretically. This driving voltage could be supplied by integrated circuits in standard CMOS technologies.⁵⁴

Therefore, this PCM-integrated PM could potentially achieve a nonvolatile write speed of microseconds and write voltage lower than 5 V, as well as volatile modulation with nanoseconds for in situ training for ONNs. Although the optical loss in volatile phase modulation of a PIN diode is higher than that of a p–i–p (PIP) or n–i–n (NIN) doping waveguide,⁵⁵^,⁵⁶ it has prominent advantages of higher speed for volatile modulation due to the usage of the free-carrier dispersion effect of silicon rather than the TO effect of silicon. Moreover, the optical loss induced during volatile modulation becomes exploitable by integrating such a design with an MRR. Finally, the PIN diode microheater can reduce the driving voltage needed for phase switching of PCM compared to the PIP or NIN doping profile.³¹^,⁴⁰

3. Multibit Low-Loss Photonic Memory

We experimentally demonstrated the [Formula omitted. See PDF]-integrated PM in the form of an all-pass MRR ([Formula omitted. See PDF] MRR). Figure 2(a) shows a schematic diagram of the fabrication process. The waveguide patterning and ion implantation were performed in an MPW run offered by IMCAS. The doping concentrations of p-type and n-type were [Formula omitted. See PDF] and [Formula omitted. See PDF], respectively. Then, metallic electrodes (5 nm Cr/100 nm Au) and [Formula omitted. See PDF] patches were fabricated by UV lithography followed by a lift-off process. Finally, a 30 nm [Formula omitted. See PDF] was deposited, and the metal contact window was opened by etching.

Figure 2(b) shows an optical microscope image of the fabricated [Formula omitted. See PDF] MRR with a radius of [Formula omitted. See PDF]. A [Formula omitted. See PDF]-long [Formula omitted. See PDF] patch was covered on a [Formula omitted. See PDF]-long PIN diode embedded in the resonator. A home-built integrated photonic measurement setup (see Sec. S3 in the Supplementary Material) was used to characterize the PM. To eliminate the temperature perturbation derived from ambient temperature variation, the temperature of the substrate of the photonic chip is held to 30°C throughout the test via a temperature control system. Figure 2(c) shows the change of normalized transmittance ([Formula omitted. See PDF]) spectra of the [Formula omitted. See PDF] MRR when the phase transition of [Formula omitted. See PDF] occurs. When [Formula omitted. See PDF] was crystallized by a 3.0 V/1 ms voltage pulse or amorphized by an 8.2 V/500 ns pulse, a resonance peak shift of 0.34 nm and an extinction ratio over 14 dB were realized.

Here, we systematically characterized the effect of amplitude and duration of voltage pulses on the multilevel switching response of photonic memories. The [Formula omitted. See PDF] patch was gradually amorphized and generated 38 levels in the PM by applying an electric pulse with a duration of 500 ns and voltage amplitudes not exceeding 8.2 V. The transmission change ([Formula omitted. See PDF]) and storage levels are shown in Fig. 3(a). Each optical storage level is the average value of 50 measurements in the same state to avoid test errors due to systematic noise. The lowest resolution of these memory states is 0.07 dB. Among them, 28 levels were distinguishable after the transmission change is converted to the linear region, which can be used for info storage for optical computing. As our simulations confirmed, prolonging the pulse width can reduce the driving voltage for the melt quenching of [Formula omitted. See PDF] during amorphization [see Fig. 3(b)]. By employing a pulse duration of [Formula omitted. See PDF], the driving voltage needed for partial amorphization of [Formula omitted. See PDF] to generate a transmittance change could be reduced to 5.3 V. The device would be damaged once the pulse duration of the relatively high-voltage amorphization pulse exceeded [Formula omitted. See PDF]; hence, the pulse duration should be kept within [Formula omitted. See PDF]. The amorphization driving voltage could be reduced to 4.4 V by narrowing the gap between the waveguide and the metal contact (see Sec. S4 in the Supplementary Material), suggesting good scaling potential with improved energy efficiency.

As for multilevel crystallization, by applying fixed voltage amplitude at 3 V and various pulse durations of no more than [Formula omitted. See PDF], 40 memory states were demonstrated with a resolution higher than 0.07 dB, as shown in Fig. 3(c). After conversion to the linear domain, there are still 34 different states (more than 5 bits). Each level was also averaged by 50 measurements. The standard deviation in Fig. 3(c) confirms that the states are separable even with noise in the measurement system. The write speed of the PM could be further improved by increasing the driving voltage for crystallization, as shown in Fig. 3(d), consistent with our design.

Hence, a 5-bit PCM-integrated PM was demonstrated, with a driving voltage lower than 10 V and a switching time within tens of microseconds. The experimental driving voltage is not as low as the simulated one, which may result from nonideal ion implantation and activation in the device fabrication.

4. Volatile Modulation-Compatible Photonic Memory for ONNs

A photonic neural network with PCM-integrated memory is of zero static power consumption, but in situ training via continually and intensively switching the phase of PCM is neither energy-efficient nor fast enough. Here, we address this issue by embedding a volatile modulation function into nonvolatile PM. Figure 4(a) shows the change of normalized transmittance spectra of the PM during volatile modulation used in the in situ training process. Note that the [Formula omitted. See PDF] patch on the PIN diode is now amorphized. The ripples of the measured spectra resulted from the Fabry–Perot resonance due to the reflection of the grating coupler. A peak shift efficiency of [Formula omitted. See PDF] was realized. Figure 4(b) shows the dynamic response of the PM when a 1.3 V, 1 MHz square-wave signal was applied. The 10%-to-90% rising time ([Formula omitted. See PDF]) and 90%-to-10% falling time ([Formula omitted. See PDF]) are characterized to be 13.4 and 23.0 ns, respectively, corresponding to a 3 dB bandwidth of 15.2 MHz.

Here, we simulated electrically programmable ONNs by Python exemplified by a [Formula omitted. See PDF] optical convolution kernel (OCK) constructed by the PM, as shown in Fig. 5(a). Since the PCM-integrated PM was demonstrated in the form of an MRR, the convolution operation was implemented through a WDM scheme. Modulated optical signals with four different wavelengths were equally sent to the OCK in four equal channels. After the optical convolution operation, optical signals were converted to electrical signals, amplified by the transimpedance amplifier, and then processed by the CPU. Any intermediate storage state could be configured from an unknown state by employing two electrical pulses (one for amorphization and the other for crystallization), and the measured transmission change ([Formula omitted. See PDF]) is shown in the inset of Fig. 5(a). Thus, our proposed OCK is capable of both fast on-chip training and computing with near-zero power consumption.

The PM-embedded OCK was theoretically verified by the MNIST handwritten digit database. Before the on-chip training of OCK execution, the states of all SbSe patches are initialized to their crystalline state. After that, the on-chip training of OCK was implemented by exploiting the volatile modulation of our PM. Then, the trained weights were written to the PM by applying a reset (amorphization) pulse followed by a fractional-crystallization pulse after the on-chip training of OCK. Figure 5(b) shows a schematic diagram of the evolution of measured transmittance spectra and kernel value. The trained and stored MRR arrays have different transmittance spectra, since the on-chip training of OCK and writing were conducted through different principles and approaches. Yet the value of weights after the on-chip training of OCK and writing should be as close as possible (and ideally the same). The question naturally arises over whether the discrete storage states of PCM-based PMs may lead to performance deterioration of the OCK. To verify this, the accuracy of predictions after the simulation of the on-chip training of OCK via PIN diodes ([Formula omitted. See PDF]) is shown in Fig. 5(c). After the trained parameters were written into the PMs, the implementation of the network reached minimal deviation in accuracy, as shown in Fig. 5(d). Note that the scale of the MRR array could be easily expanded. Considering there are [Formula omitted. See PDF] channels for data processing, the OCK could be scaled up to [Formula omitted. See PDF] by simply decreasing the radius of the [Formula omitted. See PDF] MRR to [Formula omitted. See PDF] in theory (see Sec. S5 in the Supplementary Material).

The PM-based convolution core benefits both on-chip training of OCK and low-static power computing. The on-chip training of OCK based on the volatile-compatible PM provides a training speed typically 1000 times faster than the commonly used TO scheme.²⁰ After on-chip training of OCK, the computing is done passively without static power consumption. With this scheme, the saved power consumption of an [Formula omitted. See PDF] OCK is [Formula omitted. See PDF], compared with the typical TO modulator array with 10 mW of each discrete device on average.²⁰^,⁵⁷ Therefore, the ONNs with PM are attractive in sporadic programming applications, and the power efficiency would increase with the scaling up of PICs.

In large-scale ONNs where PMs are expected to be used in the whole linear network, multibit storage of PMs can play a significant role. For instance, constructing an ONN (with a [Formula omitted. See PDF] OCK) from PMs where the in situ training results showed an averaged prediction accuracy rate of 94.64% identifying the MNIST data set, PMs need at least 4 bits to achieve comparable prediction accuracy (averaged accuracy rate [Formula omitted. See PDF]), as shown in Sec. S6 in the Supplementary Material. This indicates that multibit PMs are necessary for high-performance ONNs, and higher bits are expected for more complicated applications.

5. Conclusion

In this work, we proposed an electrically programmable phase-change PM for energy-efficient in situ training ONNs with CMOS compatibility and scalability. By integrating an [Formula omitted. See PDF] phase-change patch onto a PIN diode, we designed and experimentally validated the PCM-driven 5-bit PM using an MRR. The PM exhibits a transmittance contrast of [Formula omitted. See PDF], creating 28/34 storage levels during amorphization/crystallization, and the corresponding pulse voltages (pulse durations) are 7.4 to 8.2 V ([Formula omitted. See PDF])/3 V (10 to [Formula omitted. See PDF]). Furthermore, theoretically, complete amorphization of [Formula omitted. See PDF] can be induced by a 500-ns electrical pulse with an actuation voltage as low as 3.3 V, which can be provided by an integrated circuit with standard CMOS technology. In our experiment, fractional amorphization was achieved by applying a [Formula omitted. See PDF] voltage pulse. Volatile modulation with a bandwidth of [Formula omitted. See PDF] was also achieved in this PM when electric pulses with voltages lower than 2 V were applied, enabling a 1000 times faster training in theory for nonvolatile ONNs composed of such PMs than the commonly used TO switches. After training, PMs are configured to specific states via PIN-microheater-assisted multilevel switching (i.e., partial phase transition) of [Formula omitted. See PDF] to match the target weight values in the ONNs. According to our simulations, at least 4 bits are needed for PMs to maintain the accuracy of predictions of ONNs after the simulated in situ training when tested by the MNIST handwritten data set. This study on volatile modulation-compatible PM provides a feasible solution for constructing nonvolatile ONNs with high-speed and energy-efficient on-chip training capability.

Acknowledgments

This work was supported by the National Key Research and Development Program of China (2019YFB2203002 and 2021YFB2801300), National Natural Science Foundation of China (62105287, 91950204, and 61975179), and Zhejiang Provincial Natural Science Foundation (LD22F040002). The authors would like to acknowledge the fabrication support from the Institute of Microelectronics of the Chinese Academy of Sciences, ZJU Micro-Nano Fabrication Center at Zhejiang University, and Westlake Center for Micro/Nano Fabrication at Westlake University. The authors would also like to thank Qing Zhao and Liming Shan for their help in thin-film depositions and Xingjie Li for his help in developing the test program. The authors declare no conflicts of interest.

Maoliang Wei is a doctoral candidate student of Professor Hongtao Lin at Zhejiang University. He received his BS degree in electronic information science and technology from Xiamen University. His current research focuses on the study of micronano devices and systems.

Junying Li is an associate professor at the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, Zhejiang, China. She received her BS and PhD degrees from Chongqing University, Chongqing, China. Her research is focused on chalcogenide phase-change materials, reconfigurable photonic devices, and their applications.

Hongtao Lin is a ZJU100 Young Professor at the College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou, Zhejiang, China. He received his BS degree from the University of Science and Technology of China, Hefei, China, and his PhD from the University of Delaware, Newark, Delaware, USA. His research interests are focused on chalcogenide integrated nanophotonics and their applications.

Biographies of the other authors are not available.

References

1. C. Zhang, P. Patras and H. Haddadi, “Deep learning in mobile and wireless networking: a survey,” IEEE Commun. Surv. Tutor., 21 (3), 2224 –2287 https://doi.org/10.1109/COMST.2019.2904897 (2019).

2. L. Chen et al., “Deep neural network based vehicle and pedestrian detection for autonomous driving: a survey,” IEEE Trans. Intell. Transp. Syst., 22 (6), 3234 –3246 https://doi.org/10.1109/TITS.2020.2993926 (2021).

3. J. Chai et al., “Deep learning in computer vision: a critical review of emerging techniques and application scenarios,” Mach. Learn. Appl., 6 100134 https://doi.org/10.1016/j.mlwa.2021.100134 (2021).

4. P. Xu and Z. Zhou, “Silicon-based optoelectronics for general-purpose matrix computation: a review,” Adv. Photonics, 4 (4), 044001 https://doi.org/10.1117/1.AP.4.4.044001 (2022).

5. H. Zhou et al., “Photonic matrix multiplication lights up photonic accelerator and beyond,” Light Sci. Appl., 11 (1), 30 https://doi.org/10.1038/s41377-022-00717-8 (2022).

6. B. J. Shastri et al., “Photonics for artificial intelligence and neuromorphic computing,” Nat. Photonics, 15 (2), 102 –114 https://doi.org/10.1038/s41566-020-00754-y NPAHBY 1749-4885 (2021).

7. C. Li et al., “The challenges of modern computing and new opportunities for optics,” PhotoniX, 2 (1), 20 https://doi.org/10.1186/s43074-021-00042-0 (2021).

8. J. Liu et al., “Research progress in optical neural networks: theory, applications and developments,” PhotoniX, 2 (1), 5 https://doi.org/10.1186/s43074-021-00026-0 (2021).

9. X. Xu et al., “11 TOPS photonic convolutional accelerator for optical neural networks,” Nature, 589 (7840), 44 –51 https://doi.org/10.1038/s41586-020-03063-0 (2021).

10. T. J. Seok et al., “Large-scale broadband digital silicon photonic switches with vertical adiabatic couplers,” Optica, 3 (1), 64 –70 https://doi.org/10.1364/OPTICA.3.000064 (2016).

11. W. Bogaerts et al., “Programmable photonic circuits,” Nature, 586 (7828), 207 –216 https://doi.org/10.1038/s41586-020-2764-0 (2020).

12. S. Y. Siew et al., “Review of silicon photonics technology and platform development,” J. Lightwave Technol., 39 (13), 4374 –4389 https://doi.org/10.1109/JLT.2021.3066203 JLTEDG 0733-8724 (2021).

13. H. Shu et al., “Microcomb-driven silicon photonic systems,” Nature, 605 (7910), 457 –463 https://doi.org/10.1038/s41586-022-04579-3 (2022).

14. S. Bandyopadhyay et al., “Single chip photonic deep neural network with accelerated training,” (2022).

15. S. Pai et al., “Experimentally realized in situ backpropagation for deep learning in photonic neural networks,” Science, 380 398 –404 https://doi.org/10.1126/science.ade8450 (2023).

16. H. Zhang et al., “An optical neural chip for implementing complex-valued neural network,” Nat. Commun., 12 (1), 457 https://doi.org/10.1038/s41467-020-20719-7 NCAOBW 2041-1723 (2021).

17. J. Y. S. Tan et al., “Monadic Pavlovian associative learning in a backpropagation-free photonic network,” Optica, 9 (7), 792 –802 https://doi.org/10.1364/OPTICA.455864 (2022).

18. J. Feldmann et al., “Parallel convolutional processing using an integrated photonic tensor core,” Nature, 589 (7840), 52 –58 https://doi.org/10.1038/s41586-020-03070-1 (2021).

19. F. Ashtiani, A. J. Geers and F. Aflatouni, “An on-chip photonic deep neural network for image classification,” Nature, 606 (7914), 501 –506 https://doi.org/10.1038/s41586-022-04714-0 (2022).

20. Y. Shen et al., “Deep learning with coherent nanophotonic circuits,” Nat. Photonics, 11 (7), 441 –446 https://doi.org/10.1038/nphoton.2017.93 NPAHBY 1749-4885 (2017).

21. Z. G. Cheng et al., “On-chip photonic synapse,” Sci. Adv., 3 (9), e1700160 https://doi.org/10.1126/sciadv.1700160 (2017).

22. P. Edinger et al., “Silicon photonic microelectromechanical phase shifters for scalable programmable photonics,” Opt. Lett., 46 (22), 5671 –5674 https://doi.org/10.1364/OL.436288 OPLEDP 0146-9592 (2021).

23. D. Pérez, I. Gasulla and J. Capmany, “Programmable multifunctional integrated nanophotonics,” Nanophotonics, 7 (8), 1351 –1371 https://doi.org/10.1515/nanoph-2018-0051 (2018).

24. K. Shportko et al., “Resonant bonding in crystalline phase-change materials,” Nat. Mater., 7 (8), 653 –658 https://doi.org/10.1038/nmat2226 NMAACR 1476-1122 (2008).

25. A.-K. U. Michel et al., “Using low-loss phase-change materials for mid-infrared antenna resonance tuning,” Nano Lett., 13 (8), 3470 –3475 https://doi.org/10.1021/nl4006194 NALEFD 1530-6984 (2013).

26. L. Mao et al., “Reversible switching of electromagnetically induced transparency in phase change metasurfaces,” Adv. Photonics, 2 (5), 056004 https://doi.org/10.1117/1.AP.2.5.056004 (2020).

27. M. Wuttig and N. Yamada, “Phase-change materials for rewriteable data storage,” Nat. Mater., 6 (11), 824 –832 https://doi.org/10.1038/nmat2009 NMAACR 1476-1122 (2007).

28. J.-F. Song et al., “Integrated photonics with programmable non-volatile memory,” Sci. Rep., 6 (1), 22616 https://doi.org/10.1038/srep22616 (2016).

29. J. Geler-Kremer et al., “A ferroelectric multilevel non-volatile photonic phase shifter,” Nat. Photonics, 16 (7), 491 –497 https://doi.org/10.1038/s41566-022-01003-0 NPAHBY 1749-4885 (2022).

30. S. Abe and K. Hane, “A silicon microring resonator with a nanolatch mechanism,” Microsyst. Technol., 21 (9), 2019 –2024 https://doi.org/10.1007/s00542-014-2283-8 0946-7076 (2015).

31. J. Zheng et al., “Nonvolatile electrically reconfigurable integrated photonic switch enabled by a silicon PIN diode heater,” Adv. Mater., 32 (31), 2001218 https://doi.org/10.1002/adma.202001218 ADVMEW 0935-9648 (2020).

32. C. Ríos et al., “In-memory computing on a photonic platform,” Sci. Adv., 5 (2), eaau5759 https://doi.org/10.1126/sciadv.aau5759 (2019).

33. D. Wu et al., “Resonant multilevel optical switching with phase change material GST,” Nanophotonics, 11 (15), 3437 –3446 https://doi.org/10.1515/nanoph-2022-0276 (2022).

34. N. Farmakidis et al., “Electronically reconfigurable photonic switches incorporating plasmonic structures and phase change materials,” Adv. Sci., 9 (20), 2200383 https://doi.org/10.1002/advs.202200383 1936-6612 (2022).

35. C. Wu et al., “Low-loss integrated photonic switch using subwavelength patterned phase change material,” ACS Photonics, 6 (1), 87 –92 https://doi.org/10.1021/acsphotonics.8b01516 (2019).

36. H. Zhang et al., “Miniature multilevel optical memristive switch using phase change material,” ACS Photonics, 6 (9), 2205 –2212 https://doi.org/10.1021/acsphotonics.9b00819 (2019).

37. Y. Zhang et al., “Broadband transparent optical phase change materials for high-performance nonvolatile photonics,” Nat. Commun., 10 (1), 4279 https://doi.org/10.1038/s41467-019-12196-4 NCAOBW 2041-1723 (2019).

38. Z. Fang et al., “Non-volatile reconfigurable integrated photonics enabled by broadband low-loss phase change material,” Adv. Opt. Mater., 9 (9), 2002049 https://doi.org/10.1002/adom.202002049 2195-1071 (2021).

39. Z. Fang et al., “Ultra-low-energy programmable non-volatile silicon photonics based on phase-change materials with graphene heaters,” Nat. Nanotechnol., 17 (8), 842 –848 https://doi.org/10.1038/s41565-022-01153-w NNAABX 1748-3387 (2022).

40. C. Ríos et al., “Ultra-compact nonvolatile phase shifter based on electrically reprogrammable transparent phase change materials,” PhotoniX, 3 (1), 26 https://doi.org/10.1186/s43074-022-00070-4 (2022).

41. J. Feldmann et al., “All-optical spiking neurosynaptic networks with self-learning capabilities,” Nature, 569 (7755), 208 –214 https://doi.org/10.1038/s41586-019-1157-8 (2019).

42. X. Ma et al., “Photonic tensor core with photonic compute-in-memory,” in Opt. Fiber Commun. Conf. and Exhibit. (OFC), 1 –3 (2022).

43. T. W. Hughes et al., “Training of photonic neural networks through in situ backpropagation and gradient measurement,” Optica, 5 (7), 864 –871 https://doi.org/10.1364/OPTICA.5.000864 (2018).

44. H. Zhou et al., “All-in-one silicon photonic polarization processor,” Nanophotonics, 8 (12), 2257 –2267 https://doi.org/10.1515/nanoph-2019-0310 (2019).

45. H. Zhou et al., “Chip-scale optical matrix computation for PageRank algorithm,” IEEE J. Sel. Top. Quantum Electron., 26 (2), 8300910 https://doi.org/10.1109/JSTQE.2019.2943347 IJSQEN 1077-260X (2020).

46. S. M. Buckley et al., “Photonic online learning: a perspective,” Nanophotonics, 12 833 –845 https://doi.org/10.1515/nanoph-2022-0553 (2023).

47. S. Xu, J. Wang and W. Zou, “Optical convolutional neural network with WDM-based optical patching and microring weighting banks,” IEEE Photonics Technol. Lett., 33 (2), 89 –92 https://doi.org/10.1109/LPT.2020.3045478 IPTLEL 1041-1135 (2021).

48. F. Brückerhoff-Plückelmann et al., “Broadband photonic tensor core with integrated ultra-low crosstalk wavelength multiplexers,” Nanophotonics, 11 (17), 4063 –4072 https://doi.org/10.1515/nanoph-2021-0752 (2022).

49. B. Bai et al., “Microcomb-based integrated photonic processing unit,” Nat. Commun., 14 (1), 66 https://doi.org/10.1038/s41467-022-35506-9 NCAOBW 2041-1723 (2023).

50. R. Soref, “Mid-infrared photonics in silicon and germanium,” Nat. Photonics, 4 (8), 495 –497 https://doi.org/10.1038/nphoton.2010.171 NPAHBY 1749-4885 (2010).

51. M. Nedeljkovic, R. Soref and G. Z. Mashanovich, “Free-carrier electrorefraction and electroabsorption modulation predictions for silicon over the 1–14 μm infrared wavelength range,” IEEE Photonics J., 3 (6), 1171 –1180 https://doi.org/10.1109/JPHOT.2011.2171930 (2011).

52. K. Lei et al., “Magnetron-sputtered and thermal-evaporated low-loss Sb-Se phase-change films in non-volatile integrated photonics,” Opt. Mater. Express, 12 (7), 2815 –2823 https://doi.org/10.1364/OME.462426 (2022).

53. Y. Zhang et al., “Myths and truths about optical phase change materials: a perspective,” Appl. Phys. Lett., 118 (21), 210501 https://doi.org/10.1063/5.0054114 APPLAB 0003-6951 (2021).

54. H. Ballan and M. Declercq, High Voltage Devices and Circuits in Standard CMOS Technologies, 268 –269 Springer Science & Business Media, Boston (2013).

55. C. Zhong et al., “Fast thermo-optical modulators with doped-silicon heaters operating at 2 μm ,” Opt. Express, 29 (15), 23508 –23516 https://doi.org/10.1364/OE.430756 OPEXFF 1094-4087 (2021).

56. M. Wei et al., “TDFA-band silicon optical variable attenuator,” Prog. Electromagn. Res., 174 33 –42 https://doi.org/10.2528/PIER22011302 PELREX 1043-626X (2022).

57. S. Liu et al., “Thermo-optic phase shifters based on silicon-on-insulator platform: state-of-the-art and a review,” Front. Optoelectron., 15 (1), 9 https://doi.org/10.1007/s12200-022-00012-9 (2022).

58. J. R. Erickson et al., “Designing fast and efficient electrically driven phase change photonics using foundry compatible waveguide-integrated microheaters,” Opt. Express, 30 (8), 13673 –13689 https://doi.org/10.1364/OE.446984 OPEXFF 1094-4087 (2022).

59. H. Ma et al., “Passive devices at 2 μm wavelength on 200 mm CMOS-compatible silicon photonics platform [Invited],” Chin. Opt. Lett., 19 (7), 071301 https://doi.org/10.3788/COL202119.071301 CJOEE3 1671-7694 (2021).

AuthorAffiliation

Maoliang Wei,^1,* Junying Li,^1,* Zequn Chen,² Bo Tang,³ Zhiqi Jia,¹ Peng Zhang,³ Kunhao Lei,¹ Kai Xu,¹ Jianghong Wu,² Chuyu Zhong,¹ Hui Ma,¹ Yuting Ye,² Jialing Jian,² Chunlei Sun,² Ruonan Liu,³ Ying Sun,¹ Wei. E. I. Shahttps://orcid.org/0000-0002-7431-8121,¹ Xiaoyong Hu,⁴ Jianyi Yang,¹ Lan Lihttps://orcid.org/0000-0002-9097-9157,² Hongtao Lin^1,**
¹Zhejiang Univ. (China)
²Westlake Univ. (China)
³Institute of Microelectronics (China)
⁴Peking Univ. (China)
^*These authors contributed equally to this work.
^**Address all correspondence to Hongtao Lin, [email protected]

Word count: 5138

Show less

© 2023. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.

Abstract

Translate

Optical neural networks (ONNs), enabling low latency and high parallel data processing without electromagnetic interference, have become a viable player for fast and energy-efficient processing and calculation to meet the increasing demand for hash rate. Photonic memories employing nonvolatile phase-change materials could achieve zero static power consumption, low thermal cross talk, large-scale, and high-energy-efficient photonic neural networks. Nevertheless, the switching speed and dynamic energy consumption of phase-change material-based photonic memories make them inapplicable for in situ training. Here, by integrating a patch of phase change thin film with a PIN-diode-embedded microring resonator, a bifunctional photonic memory enabling both 5-bit storage and nanoseconds volatile modulation was demonstrated. For the first time, a concept is presented for electrically programmable phase-change material-driven photonic memory integrated with nanosecond modulation to allow fast in situ training and zero static power consumption data processing in ONNs. ONNs with an optical convolution kernel constructed by our photonic memory theoretically achieved an accuracy of predictions higher than 95% when tested by the MNIST handwritten digit database. This provides a feasible solution to constructing large-scale nonvolatile ONNs with high-speed in situ training capability.

Details

Title

Electrically programmable phase-change photonic memory for optical neural networks with nanoseconds in situ training capability

Author

Maoliang Wei; Li, Junying; Chen, Zequn; Tang, Bo; Jia, Zhiqi; Zhang, Peng; Kunhao Lei; Xu, Kai; Wu, Jianghong; Zhong, Chuyu; Ma, Hui; Ye, Yuting; Jian, Jialing; Sun, Chunlei; Liu, Ruonan; Sun, Ying; Sha, Wei E I; Hu, Xiaoyong; Yang, Jianyi; Li, Lan; Lin, Hongtao

First page

46004

Section

Research Articles

Publication year

2023

Publication date

Jul 2023

Publisher

S P I E - International Society for

ISSN

25775421

Source type

Scholarly Journal

Language of publication

English

DOI

https://doi.org/10.1117/1.AP.5.4.046004

ProQuest document ID

2862388950

Electrically programmable phase-change photonic memory for optical neural networks with nanoseconds in situ training capability

Jump to:

Full text

Abstract

Details

Suggested sources