Content area
Single-pixel imaging (SPI) is a promising technology for optical imaging beyond the visible spectrum, where commercial cameras are expensive or unavailable. However, limitations such as slow pattern projection rates and time-consuming reconstruction algorithms hinder its throughput for real-time imaging. Consequently, conventional SPI is inadequate for high-speed, high-resolution tasks. To address these challenges, we developed an ultrahigh-throughput single-pixel complex-field microscopy (SPCM) system utilizing frequency-comb acousto-optic coherent encoding (FACE). This system enables real-time complex-field monitoring in the non-visible domain. Operating at 1030 nm, our system achieves a record-high space-bandwidth-time product (SBP-T) of 1.3 × 107, surpassing previous SPCM (~104), SPI (~105), and even certain types of commercial near-infrared cameras (~106). It supports real-time streaming at 1000 Hz with a frame size of 80 × 81 pixels and a lateral resolution of 3.76 μm across an approximately 300 μm field of view. We validated the system by imaging dynamic transparent scenes, including microfluidics, live microorganisms, chemical reactions, as well as imaging through scattering media. This advancement offers a superior solution for high-speed, high-resolution complex-field imaging beyond the visible spectrum, significantly enhancing SPI performance across various applications.
Given the ultrafast spatiotemporal nature of frequency-comb acousto-optic coherent encoding, ultrahigh-throughput single-pixel complex-field microscopy with a record-high space-bandwidth-time product enables real-time monitoring over various transparent dynamic scenarios in special spectrums.
Introduction
Optical imaging in the visible spectrum has advanced rapidly in recent years, largely due to mature silicon-based semiconductor fabrication technologies that enable high-performance cameras. However, for spectra beyond the visible range—such as near-infrared (NIR), X-ray, and terahertz—the development of pixel array detectors is still in its early stages. As a result, cameras operating in these non-visible spectra are often inefficient or prohibitively expensive, presenting significant challenges and costs for imaging in these domains. Despite these limitations, imaging in non-visible spectra is essential because it provides information that is often inaccessible with visible light, offering deeper insights into the structural and functional properties of various targets. For example, NIR imaging optimizes penetration depth and resolution while being sensitive to endogenous biological contrast agents like fat or protein1,2. This sensitivity enables the detection of subtle phenomena across biological, material, and chemical fields, driving advances in medical diagnostics, materials science, and chemical analysis3, 4–5.
Single-pixel detectors provide superior performance across a broad spectral range compared to pixel-array detectors, driving advancements in optical imaging beyond the visible spectrum through an emerging computational technique known as single-pixel imaging (SPI)6, 7, 8, 9–10. SPI utilizes the high detection capabilities of single-pixel detectors by combining active patterned illumination with computational image reconstruction11, 12, 13, 14, 15, 16, 17, 18–19, making it a promising alternative for imaging in challenging spectral regimes such as mid-infrared20,21, terahertz22, 23–24, and even photoacoustic waves25,26. Furthermore, SPI has been demonstrated to be extended to single-pixel complex-field microscopy (SPCM) within a holographic framework27, 28–29, enabling the retrieval of both phase and amplitude information. This capability is particularly useful for observing dynamic phenomena in the NIR range, such as live cells, swimming microorganisms, and chemical reactions, which are difficult to differentiate using intensity-only images due to the relatively uniform absorption of biological samples and solutions30. However, SPI and SPCM systems typically have lower information throughput than pixel-array camera systems because they rely on a single detector for data collection, making real-time monitoring of rapidly changing dynamic scenes challenging. The throughput of these systems is determined by two key factors: frames per second (FPS) and frame size. The interplay between these factors necessitates a more comprehensive metric: the space-bandwidth-time product (SBP-T), which quantifies the amount of information collected per unit time by multiplying FPS and frame size31,32.
The primary limitation of the SBP-T in SPI/SPCM systems arises from slow pattern projection rates. While fast digital micromirror devices (DMDs) with projection rates up to 23 kHz have allowed SPI/SPCM systems to achieve SBP-T values between 103 and 104 29,33, 34, 35–36, further advancements are needed. Incorporating heterodyne holography has enabled a DMD-based SPCM system to achieve a fourfold increase in SBP-T, reaching 4.6 × 10437. To further enhance SBP-T, overcoming the projection rate limitations of DMDs is essential. Researchers have developed methods to achieve higher pattern projection rates. One approach involves using a rotational mask pattern, which raises the projection rate to 2.4 MHz38. More advanced techniques have pushed projection rates to 14.1 MHz by combining laser-scanning hardware with a DMD39. However, high-speed mechanical scanning introduces instability, preventing these systems from consistently operating at their maximum projection rate limits. Moreover, higher projection rates increase the computational burden for image reconstruction, limiting FPS and reducing information throughput during real-time monitoring. For example, an SPI system using a rotational mask pattern achieves real-time imaging at 72 FPS with a frame size of 101 × 103 pixels, resulting in an SBP-T of 7.5 × 10538. Another SPI system that combines laser-scanning hardware with a DMD achieves 51 FPS with a frame size of 59 × 61 pixels, yielding an SBP-T of approximately 1.8 × 105 39. Despite these advancements, the SBP-T of SPI/SPCM systems remains several orders of magnitude lower than that of a commercial InGaAs camera operating in the NIR spectrum (e.g., ARTCAM-008TNIR, ARTRAY), which achieves an SBP-T of 7.4 × 106 with a frame size of 320 × 256 and an FPS of 90 Hz.
To achieve SPI/SPCM throughput comparable to, or surpassing, that of commercial cameras in specialized spectra such as the NIR spectrum, it is crucial to develop innovative fast pattern projection schemes combined with efficient image reconstruction algorithms for streamlined real-time imaging. In this study, we present an ultrahigh-throughput SPCM system using frequency-comb acousto-optic coherent encoding (FACE), which outperforms commercial cameras in the NIR spectral regime. The FACE scheme achieves remarkable efficiency by integrating advancements in motionless fast pattern refresh with effective complex-field imaging reconstruction. For pattern refresh, the FACE scheme modulates projection patterns into two-dimensional frequency combs. The asynchronous oscillation of different tones within these combs allows the patterns to evolve rapidly over time, ensuring fast and stable pattern refresh without mechanical motion. For complex-field imaging reconstruction, the FACE scheme uses fast Fourier transformation (FFT) to accurately decode the spatial information of each tone, enabling efficient reconstruction without requiring extensive computational resources. Furthermore, the scheme maintains a zero-order unmodulated component as a coaxial reference, allowing for stable complex-field retrieval within a highly efficient common-path heterodyne holography setup.
The developed FACE-SPCM system enables real-time monitoring with a frame size of 80 × 81 at 1000 Hz, supporting complex-field microscopy with a lateral resolution of 3.76 μm and a field of view (FoV) of approximately 300 μm. Data acquisition and image reconstruction are performed efficiently, achieving a record-high SBP-T of up to 1.3 × 107 in the NIR spectral regime. A comprehensive comparison of representative aforementioned traditional SPI/SPCM systems, along with a broader category of imaging systems, highlighting key imaging parameters, is provided in Supplementary Note 1. The SBP-T (~107) achieved in this study is two to three orders of magnitude higher than previous records for SPCM (~104) and SPI (~105), and even surpasses those of certain commercial NIR cameras (~106). A series of imaging experiments are conducted for various dynamic transparent scenes even through scattering media, including high-speed microfluidics, live microorganisms, and chemical reactions, demonstrating the capability of real-time diagnosis of physicochemical phenomena. This development paves the way for future imaging systems based on single-pixel detectors, establishing the FACE scheme as a leading solution for high-speed, high-resolution complex-field imaging.
Results
Operational principle of FACE-SPCM
The foundational concept for achieving ultrahigh-throughput SPCM is to use frequency components to encode spatial positions. This approach is similarly applied in various imaging fields that use different types of spectral encodings, such as time-stretch techniques in ultrafast optics and acousto-optic modulation for intensity oscillation scanning40, 41, 42, 43, 44–45. These methods, which primarily function within one-dimensional (1D) intensity scanning frameworks, have been tailored for specific applications like scanning-based flow cytometry and cell sorting46, 47–48. Historically, a virtually imaged phased array combined with a grating has been used to create a two-dimensional (2D) mapping between frequency components and spatial positions, although this configuration has been limited by frame size40,49.
In this study, we extend the concept of spectral-encoded imaging to a more general 2D imaging scenario in complex fields by developing the FACE-SPCM system. The operational principle is illustrated in Fig. 1. This spectral-encoded projection scheme eliminates the need for time-consuming sequential projection of different illumination patterns by generating a pattern that evolves naturally over time. The key to extending spectral-spatial mapping from 1D to 2D with spectral-encoded projection is the use of a pair of cascaded acousto-optic deflectors (AODs) oriented in orthogonal directions. Each AOD is driven by different 1D frequency-comb electronic signals over the same modulation bandwidth of , with N and M tones, respectively, and varied spacing between comb teeth of and . Precise alignment of these AODs using a 4f system with identical or cylindrical lenses, as detailed in Supplementary Note 2, is crucial. To effectively demonstrate the FACE scheme for subsequent applications, the polarization conditions involved in spectral-encoded projection are detailed in Supplementary Note 3. Under these conditions, the FACE scheme enables a one-to-one mapping between spatial positions and frequency components, forming the basis of the ultrahigh-throughput SPCM system. A similar strategy has been employed for real-time wavefront shaping through scattering media using acousto-optic modulators with few degrees of freedom50,51.
Fig. 1 Schematic diagram of the FACE-SPCM system. [Images not available. See PDF.]
The 2D spectral-encoded projection is generated by a pair of orthogonal optical frequency combs through acousto-optic modulation. Complex-field details of the scene are encoded within a temporal sequence using a coaxial heterodyne holographic setup. This configuration enables the efficient retrieval of complex-field information in various dynamic scenarios, such as microfluidics, live tissue, and chemical reactions. AODx (AODy): acousto-optic deflector in the x (y) direction; SM scattering medium, PD single-pixel photodetector
Mathematically, the correspondence between spatial positions and frequencies in the FACE scheme is defined by:
1
where , with . Here, and represent the relative lateral positions in orthogonal directions, while is the base frequency of light at the reference point. and are the relative distances resulting from 2D Bragg diffraction along orthogonal directions. A detailed explanation for designing this FACE framework in coaxial heterodyne holography without overlap is provided in Supplementary Note 4.As a 2D FACE pattern, the spectral-encoded projection interacts with the target object to achieve complex-field microscopy. For simplicity, while maintaining generality, we represent the complex-field information of the object as , where and denote the 2D amplitude and phase distributions, respectively. After interaction, the resultant light field is described by:
2
Here, we assume each frequency component in the FACE pattern has equal amplitudes and phases. However, in practice, factors such as non-uniform circuit amplification, varying diffraction efficiencies, and differences in optical path lengths can cause significant variations in amplitude and phase. Nevertheless, in a linear imaging system, these effects can be compensated for through a null calibration process, as detailed in Supplementary Note 5.To retrieve the phase information of the object, the FACE scheme is designed to maintain an unmodulated component with a frequency as a coaxial reference using a convergence grating, offering compact and robust complex-field imaging without external disturbances. To achieve high spatial coherence in coaxial interference, the optimal parameters for the convergence grating are detailed in Supplementary Note 6. In this configuration, the light detected by a single-pixel detector is given by:
3
Here, is the amplitude of the unmodulated component, i.e., the coaxially propagated reference light, which is significantly stronger than the amplitudes of other frequency components. The wide bandwidth of single-pixel detectors, which can reach hundreds of megahertz or even gigahertz, allows for efficient extraction of through FFT within the holographic framework, enabling real-time data processing. A detailed reconstruction of SPCM for dynamic scenes using Fourier analysis is further elaborated in the “Methods” section. The digital flexibility of electronic signal modulation and the choice of physical equipment allow the FACE scheme to support various configurations, adaptable to different dynamic scenarios, as outlined in Supplementary Note 7. To show the superior scalability of FACE-SPCM, further demonstrations adopting alternative imaging schemes and imaging through scattering media are presented in Supplementary Notes 8 and 9.Characterization of imaging resolutions of the FACE-SPCM system
With the operational principle established, we constructed an experimental setup for SPCM integrated with the FACE system, as shown in Fig. 2a. For a comprehensive characterization of the FACE-SPCM system, full details of the experimental setup and its related parameters are provided in the “Methods” section. Rigorous characterization of the laser’s performance, particularly concerning spectral aliasing—critical to the integrity of the FACE-SPCM system—is thoroughly evaluated in Supplementary Note 10.
Fig. 2 Experimental setup and imaging performance of the FACE-SPCM system. [Images not available. See PDF.]
a The optical structure of the FACE-SPCM system is shown from both top and front views, with a resolution test target used for the initial demonstration of the FACE scheme. M1–M2: mirrors; L1–L6: lenses. b Cross-sectional diagram of the resolution test target, with dashed boxes in purple and red indicating the regions selected for imaging. c Reconstructed amplitude images corresponding to the marked regions are presented. The image framed in red (scale bar: 50 μm) shows the finest resolution lines, while images framed in purple (scale bar: 100 μm) demonstrate general imaging performance. Average CNR: 28.77. d, e Expanded 1D profiles from the highlighted area in (c), verifying the resolution capability of the FACE-SPCM system. d Horizontal profile. e Vertical profile. Error bar: standard deviations of five independent realizations
Our experimental demonstration began with a quantitative assessment of the imaging resolution. We used a standard positive resolution target (R3L3S1P, Thorlabs) as a benchmark to evaluate the system’s performance. Resolution is a critical parameter in SPCM systems as it directly impacts the ability to discern fine details. For our demonstration, the system was configured to achieve an effective frame size of 80 × 81, providing a lateral resolution of 3.76 μm across a FoV of approximately 300 μm. This setup enabled the system to resolve intricate structural details of the target, specifically those in group 7-1, which measure 3.91 μm. For simplicity, we focused on amplitude images, as the resolution profiles for both amplitude and phase imaging are identical in our system. To quantitatively assess imaging quality, we employed the contrast-to-noise ratio (CNR) rather than any reference-based metrics to evaluate the performance during practical experiments. This metric is defined as the difference in signal intensity between a target and the background region, divided by the background fluctuations, i.e., noise, in the reconstructed amplitude images. Figure 2b shows a high-definition image of the central portion of the resolution target, with dashed boxes indicating areas selected for detailed observation. The imaging results, presented in Fig. 2c, consist of nine separate images corresponding to predefined positions within Fig. 2b. A detailed analysis of the group 7-1 structure, centered in these images, was performed using 1D amplitude profiles plotted along the horizontal and vertical directions (Fig. 2d, e). These profiles clearly reveal three narrowly spaced dips and a single broader dip, confirming the system’s theoretical resolution capabilities.
FACE-SPCM for microfluidics microscopy through scattering media
We demonstrated the imaging capability of FACE-SPCM for microfluidics microscopy. The microfluidic setup is shown in Supplementary Note 11, where oil-encapsulated droplets are sequentially generated as they merge and flow through a channel in the chip. Details on the preparation of microfluidic chips for droplet formation are provided in sequence. The oil, serving as the carrier medium, dynamically interacted with various solutions, with pure water typically used as the dispersed medium. The interactions were precisely controlled by automated syringe pumps that regulate the flow rates, allowing the exploration of different fluidic behaviors and droplet dynamics. As displayed in Fig. 3a, varying flow rates controlled by automated syringe pumps cause transitions from bubble formation to encapsulated droplets, categorized as fluidic situations 1–3. Corresponding images for respective situations are also provided. However, due to their inability to provide phase information and relatively low FPS, conventional bright-field microscopes (such as the Mshot MSX2) are unable to perform real-time monitoring of high-speed transparent microfluidics, as evidenced by the blurred images shown in Fig. 3b. A microscopy video, slowed down by a factor of eight, is provided in Supplementary Movie S1.
Fig. 3 Experimental demonstration of FACE-SPCM in microfluidics microscopy. [Images not available. See PDF.]
a The microfluidic device is designed for rapidly forming a sequence of flowing droplets through a microchannel. Water is used as the dispersed medium, while oil acts as the carrier medium. By varying the flow rates controlled by an automatic syringe pump, the system transitions through different states are clearly captured by FACE-SPCM, from bubbles to stable encapsulated droplets: (1) Stable bubble formation, (2) Transition from bubbles to encapsulated droplets, and (3) Stable encapsulated droplets. Scale bar: 100 μm. b Intensity microfluidics images obtained by bright-field microscope without phase revelation, captured in real time with a refresh rate of 15 FPS. Scale bar: 100 μm. c Reconstructed complex-field microscopy images of microfluidics captured in real-time FACE-SPCM with an ultrashort temporal interval of 1 ms. Scale bar: 100 μm. Average CNR: 16.55
The FACE-SPCM system, maintaining the high-resolution setting from earlier experiments, achieves complex-field microscopy with an imaging FPS of up to 1000 Hz. The dynamics of droplet formation were meticulously captured in a series of holograms, sampled at regular intervals, with various droplet morphologies showcased in Fig. 3c. This rapid sampling capability enables the observation of subtle flow variations within exceptionally brief time frames, with critical flowing changes highlighted by red dashed arrows. In fluidic situation 1, shown in Fig. 3c, the droplet formation process is depicted with the syringe pump for the carrier medium set at 2.0 mL h−1, while the dispersed medium is set at 0 mL h−1. Under these conditions, stable oil-encapsulated water droplets failed to form, instead resulting in bubbles that indicate unsteady flow dynamics. Although both oil and water are transparent with low absorption, their distinct optical properties—highlighted by their differential refractive indices—reveal the suboptimal conditions for stable droplet formation. In contrast, fluidic situations 2 and 3 in Fig. 3c show more favorable conditions with the syringe pump settings adjusted to 2.5 mL h−1 for the carrier medium and 1.1 mL h−1 for the dispersed medium. These adjustments clearly promote more stable droplet formation, as detailed in the comparative analysis. The corresponding dynamic video results, slowed down by eight times, are provided in Supplementary Movie S2. For a detailed exploration of dynamic imaging across different modalities, see Supplementary Notes 7 and 8. To verify the superior robustness against challenges, the FACE-SPCM system enables microfluidics microscopy through scattering media while consistently maintaining its FoV and resolution. The imaging results are provided in Supplementary Note 9, while dynamic video slowed down by eight times in Supplementary Movie S3, suggesting the potential applications of SPCM in biomedical and environmental sciences. It is important to note that in this study, the scattering medium is placed in front of the detector, following the same imaging configuration as previous works52,53. A more realistic non-invasive imaging scenario, where scattering occurs both before and after the target scene, remains unachievable using current SPI techniques.
To enhance the technical demonstration for biological applications, we also employed the FACE-SPCM system to image a group of paramecia for microorganism observation, with experimental results detailed in Supplementary Note 12. A corresponding video, slowed down by eight times, is included in Supplementary Movie S4. The preparation process is further elaborated in the “Methods” section.
FACE-SPCM for real-time imaging of chemical reactions and solution mixing
After monitoring the dynamics of microfluidics and live microorganisms, the FACE-SPCM system demonstrated its potential for real-time analysis of chemical reactions in transparent solutions, leveraging robust phase information retrieval and high-speed imaging capabilities. Unlike biological samples containing fats and proteins that show slight absorptive contrast due to NIR specialization, uniform water-based solutions pose an additional challenge in visualizing transparent details. To demonstrate the system’s capability to discern chemical interactions, we designed two experimental schemes: Scheme 1 for a chemical reaction and Scheme 2 for a simple physical mixture. Both schemes involve two transparent solutions, labeled Solution 1 and Solution 2. As shown in Fig. 4a, the microfluidic chip was specifically designed to efficiently mix the two chemical reagents: Solution 1 enters through the side inlets and Solution 2 through the bottom inlet. The confluence corner, where the solutions initially meet, is highlighted in a red box, while the downstream channel, where mixing continues, is marked in a blue box. To ensure stable mixing, an automatic syringe pump injected Solution 1 at 2 mL h−1 from the side inlets and Solution 2 at 1.3 mL h−1 from the bottom inlet. A detailed microstructure of this microfluidic chip specifying solution mixing is provided in Supplementary Note 11.
Fig. 4 Experimental demonstration of FACE-SPCM in a chemical reaction. [Images not available. See PDF.]
a Schematic of the microfluidic device designed for mixing different liquids, highlighting the microstructure at the confluence corner (red) and the downstream channels (blue). Scheme 1: Acetic acid (Solution 1) and baking soda (Solution 2) are used to demonstrate a chemical reaction. b Complex-field microscopy at the confluence corner. Scale bar: 50 μm. Average CNR: 4.98. c Reconstructed complex-field microscopy captured in real-time sequences over short and long temporal intervals. The entire process of the acid-base neutralization reaction is visualized, including separation, mixing, chemical reaction, and CO2 bubble formation, with detailed labeling of the chemical components. Scale bar: 100 μm. Average CNR: 4.70
As shown in Fig. 4, Scheme 1 involves a chemical reaction between acetic acid (CH3COOH) as Solution 1 and baking soda (NaHCO3) as Solution 2, resulting in an acid-base neutralization reaction that produces sodium acetate (CH3COONa) and carbon dioxide gas (CO2). Using high-precision settings, our SPCM system captured detailed amplitude and phase images at the confluence corner (red box in Fig. 4a), as shown in Fig. 4b. Due to the high flow rates at the confluence corner (2 mL h−1 for Solution 1 and 1.3 mL h−1 for Solution 2), the reaction was not fully completed at this location. Instead, the main site of reaction and mixing occurred further downstream (blue box in Fig. 4a), which was more suitable for observing the dynamics with the SPCM system. A subset of real-time complex-field imaging results is shown in Fig. 4c, with specific elapsed times indicated above each image. Because of the solutions’ transparency, amplitude imaging provides minimal detail, whereas phase imaging reveals significant contrast due to differences in optical refractive indices. Figure 4c shows CH3COOH in the central region flanked by NaHCO3 on both sides. As these solutions merged and flowed, their distinct phase boundaries gradually blurred, indicating the ongoing chemical reaction that forms CH3COONa and CO2, as evidenced by bubble formation within the mixture.
Conversely, as a control group without chemical reactions, Scheme 2 was prepared using ethanol (C2H5OH) as Solution 1 and pure water (H2O) as Solution 2, as shown in Fig. 5. Using the same microfluidic setup, the primary sites for real-time analysis with the SPCM system were at the confluence corner and the downstream channel, highlighted by the same red and blue boxes in Fig. 4a, respectively. At the confluence corner displayed in Fig. 5a, phase contrast reveals a distinct separation between the solutions due to their high flow rates, while complete mixing occurs in the downstream channel. Thus, as illustrated in Fig. 5b, this setup allows for real-time monitoring of the ethanol-water mixing process with complex-field reconstructions at specific elapsed times. Unlike the minimal changes in amplitude, the phase boundaries between C2H5OH and H2O gradually blur as mixing progresses, highlighting the difference between a homogeneous mixture and a chemical reaction. Figure 5c shows the 1D profiles along the red dotted lines from the phase images above, quantitatively demonstrating a gradual decrease in phase contrast as mixing continues, providing additional visual confirmation.
Fig. 5 Experimental demonstration of FACE-SPCM in solution mixture. [Images not available. See PDF.]
a Scheme 2: Ethanol (Solution 1) and pure water (Solution 2) are selected to illustrate a non-reactive chemical scenario. Corresponding complex-field microscopy at the confluence corner is shown. Scale bar: 50 μm. Average CNR: 4.86. b Reconstructed complex-field microscopy captured in real-time sequences over both short and long temporal intervals. The entire mixing process of ethanol and water is visualized, depicting the separation and mixing stages without any detectable chemical reactions. The involved ingredients are clearly labeled. Scale bar: 100 μm. Average CNR: 1.17. c 1D profiles of phase images across different stages, quantitatively showing the continuous evolution of the ethanol–water mixture
These results demonstrate the effectiveness of the FACE-SPCM system in observing and analyzing real-time chemical reactions and solution mixing over short durations. Additional dynamic visualizations of these chemical processes, slowed down by a factor of eight, are available in Supplementary Movie S5. The detailed preparation of the chemical solutions is described in the “Methods” section.
Discussion
The FACE-SPCM system demonstrates excellent scalability and adaptability for various dynamic conditions. Its scalability spans different imaging setups and spectral configurations, largely due to the flexibility of the FACE probing structure. For example, Imaging Scheme 2 is optimized for monitoring dynamic structures with a large FoV, whereas Imaging Scheme 3 prioritizes image quality with minimal aberrations in scenarios that do not require ultrafast observation. When scaling the system, inherent trade-offs between imaging parameters must be considered. First, as in many other imaging systems, there is a linear proportional relationship between the value of resolution and FoV. Second, an inverse linear relationship exists between frame size and FPS. For a given acousto-optic modulation bandwidth, increasing the number of pixels requires resolving a correspondingly larger number of frequency components. This, in turn, necessitates a longer measurement time series based on the principles of the Fourier transformation, thereby reducing the achievable FPS. A quantitative analysis of this compromise is provided in Supplementary Note 13. With these two trade-offs in mind, the system design can be strategically scaled to accommodate different imaging requirements. Besides, considering that computational imaging has demonstrated improvements in both resolution and FPS through compressive sensing54, we envision that by digitally redesigning the FACE pattern with the accommodation of probing structure, these concepts could also contribute to enhancing the imaging performance of FACE-SPCM in future developments.
While this study demonstrates the system at 1030 nm in the NIR range, the FACE scheme is not restricted to this wavelength. In future implementations, it can be readily adapted to other spectral ranges by modifying the excitation light source, AODs, corresponding optical components, and single-pixel detectors. Currently, commercial AOD technologies support operation across a broad spectral range, from ultraviolet to mid-infrared (380–2100 nm), while suitable light sources, optical elements, and single-pixel detectors are available over an even wider range. Further increasing the frame size while maintaining the reported FPS is currently constrained by the modulation bandwidth of the AODs. If future advancements enable AODs with GHz-level modulation bandwidth while maintaining comparable performance, the frame size could be increased by two to three orders of magnitude. In such a scenario, beat frequencies in the GHz range would need to be measured, which would exceed the capabilities of the current single-pixel detectors operating in the MHz range. To accommodate this, higher-bandwidth single-pixel detectors capable of GHz-range operation—already commercially available—would be required.
The proposed FACE scheme ensures efficiency in both spatial and temporal domains, providing high adaptability across various complex scenarios. For spatial imaging, the FACE pattern enables a wide 2D range to capture global imaging details in an ultrashort time, making it well-suited for dynamic processes beyond 1D scanning-based cytometry applications. For real-time imaging over extended periods, the system supports continuous data acquisition and processing, with storage and computation managed via a buffer at the computing terminal. This setup allows for long-term monitoring, constrained only by memory capacity. For example, storing a one-minute video at a 50 MHz sampling rate requires approximately 3 GB of data.
In summary, the developed FACE-SPCM system demonstrates exceptional real-time monitoring capabilities for complex-field microscopy in the NIR spectrum, achieving an FPS of 1000 Hz, a lateral resolution of 3.76 μm, a FoV of approximately 300 μm, and a record-high SBP-T of 1.3 × 107. A series of experimental demonstrations—including the real-time monitoring of microfluidics, live microorganisms, and chemical reactions—validate the system’s adaptability and effectiveness in capturing phase information in both transparent and scattering environments. The common-path interference structure enhances performance by minimizing environmental disturbances. The FACE scheme significantly boosts the information throughput of systems using single-pixel detectors, enhancing their capacity for real-time complex-field monitoring in the NIR spectrum. This innovative technology offers broad utility across diverse scientific and industrial applications, underscoring its transformative impact and versatility.
Materials and methods
Imaging reconstruction
The illumination framework of the FACE scheme involves a spectral-encoded projection with specified frequency tones and an unmodulated component with a base frequency . When this illumination interacts with the complex-field object , the resulting signal, carrying the encoded information, is captured by the single-pixel photodetector. The interaction between the different frequency tones and the unmodulated base frequency creates field interference, resulting in a complex heterodyne holography signal. This signal can be mathematically represented in a simplified form, as shown in Eq. 3, through sideband intensity detection. However, in practical scenarios, there are residual 1D diffractions with frequencies due to the inherent architecture. As discussed in detail in Supplementary Note 3, the polarization effects are considered, and a comprehensive form can be derived for the full characterization of the cosine term as follows:
4
We then process the signal using FFT, expressed as . Given the definition and the Dirac delta function , an updated representation in the frequency domain is developed as:
5
where DC represents the synthesis of all direct current terms during the beat frequency of heterodyne holography. The cleavage of frequency components, such as the twin terms , highlights the essence of side-band detection resulting from the cosine representation. As detailed in Supplementary Note 4, thanks to the asymmetry in the number of tones along orthogonal directions, varied frequency spacing and subtle frequency shifts jointly ensure a rigorous one-to-one spatial-frequency mapping, preventing spectral aliasing during the coherent detection. This is particularly evident in the inequality . As derived from Eq. 5, the complex-field information from the object is directly encoded in the second term. By leveraging the known beat frequency , the complex-field object can be retrieved as:
6
Time consumption in complex-field imaging reconstruction
Real-time monitoring with the FACE-SPCM system involves two key components: data acquisition and data processing, both executed in a streamlined manner. For a well-optimized configuration of the FACE scheme (e.g., Imaging Scheme 1), the data acquisition time for a single reconstruction is 1 ms. Data processing is performed in parallel with data acquisition using a central processing unit (CPU: Intel i7-13700K) and a high-speed graphics processing unit (GPU: NVIDIA RTX 4070Ti). The time required for the FFT operation is approximately 282 μs, the calibration operation takes around 36 μs, and the index extraction operation for display is about 13 μs. Consequently, the FACE-SPCM system enables real-time imaging at 1000 Hz during streamlined data acquisition and processing. The detailed workflow is provided in Supplementary Note 14.
Experimental setup
The experimental configuration of the FACE-SPCM system is schematically illustrated in Fig. 2a. A continuous-wave laser (DHNL 1030.10-50-P-N-M-FA) with an ultra-narrow bandwidth (~3 kHz at 1030 nm) served as the stable probing source up to 400 mW. A quarter-wave plate and a half-wave plate ensured proper linear polarization alignment, while a polarization beam splitter regulated the laser intensity. The laser light was directed via mirrors and expanded using lenses with focal lengths of 8 mm and 60 mm before reaching the acousto-optic deflectors (AODs) (DTSX-400-1064, AA Opto-Electronic). An iris with a 5 mm diameter filtered the expanded beam to fit the entrance of the AODs, optimizing the time-bandwidth product through two-dimensional (2D) Bragg diffraction in the FACE scheme. The time-bandwidth product, , related to the effective number of modulation branches and was proportional to the input beam diameter under efficient modulation. The AODs were driven by a tailored frequency-comb electronic signal from a function generator (DG4102, RIGOL), which was amplified by amplifiers (AMPB-B-30-10.500, AA Opto-Electronic). The frequency comb for AODx spanned 80 tones (63–88 MHz), while AODy covered 81 tones (63–88 MHz) within the allowed 61–91 MHz bandwidth. The encoding strategy, inspired by orthogonal frequency division multiplexing (OFDM) principles, is detailed in Supplementary Note 4. After mirror adjustments, the expanded beam underwent 2D Bragg diffraction via the AODs at an optimal angle (~120 mrad). AODx generated diffracted branches, which were further expanded into a 2D sub-beam array by AODy, while maintaining an unmodulated component for common-path interference. A 4f system, composed of identical lenses with a 125 mm focal length, ensured precise pattern alignment and convergence, as detailed in Supplementary Note 2. The resulting 2D sub-beam array scanned angles from 0 to 140 mrad in both the x and y directions, achieving spectral-encoded projection with distinct temporal oscillations. The relationship between frequency and angle was central to the innovative aspect of the FACE scheme. After passing through the AODs, the beam was directed through an imaging lens (7.5 mm focal length) and an acquisition lens (25 mm focal length), forming another 4f system. The imaging lens collected all beams at the confocal plane to interact with the sample for complex-field microscopy, while the acquisition lens reconverged the interacted beams at the rear focal plane of the 4f system. During imaging, the 2D sub-beam array had a relatively low power of approximately 100 μW and was projected over an area of 300 μm, resulting in an optical intensity of 110 mW cm−2, which is below the American National Standards Institute safety limit of 1000 mW cm−2 for maximum permissible exposure at near-infrared wavelengths. A 50 lp mm−1 grating was used for unified direction in common-path heterodyne holography, as described in Supplementary Note 6. The combined beam was then directed to a single-pixel photodetector (APD130C, Thorlabs) for signal collection. Real-time data acquisition was performed using a data acquisition card (ATS9462, AlazarTech), configured with a sampling rate of 50 MSa s−1.
Experimental preparation for living microorganisms and chemical reactions
The biological demonstration focuses on real-time monitoring of the living microorganism Paramecium. Paramecia were cultured in a nutrient-rich medium under well-lit conditions at temperatures of 20–25 °C and harvested after 2–3 days of growth. A small drop of the culture was collected with a pipette and placed on a clean glass slide. To maintain the paramecia’s vitality for extended observation, a coverslip was not applied over the liquid drop. For the chemical analysis, two water-based solution mixtures were monitored to determine the occurrence of chemical reactions, organized into two schemes. Scheme 1 examines the reaction between acetic acid (CH3COOH) and sodium bicarbonate (NaHCO3). Both reactants were prepared at equal concentrations to ensure effective acid-base neutralization. A 100 mL solution of 5% NaHCO₃ was prepared by dissolving 5 g of pure sodium bicarbonate powder in water and filling to the 100 mL mark in a graduated cylinder. Similarly, a 100 mL solution of 5% acetic acid was prepared by diluting a 10% acetic acid solution to the required concentration. Scheme 2 serves as a control, using a mixture of ethanol (C2H5OH) and water (H2O), where no chemical reaction occurs. For this scheme, 100 mL of absolute ethanol was prepared without water, and pure water was used as the H2O component.
Acknowledgements
We thank Dr. Can Li from the National University of Defense Technology for his significant support of this research. This work was supported in part by the National Natural Science Foundation of China (12404380, 12325408, 12274129, 12374274, 12274139, 62175069, 62175066, 62475070, 12474404), the Fundamental and Applied Basic Research Project of Guangzhou (2024A04J2001), the Guangdong Basic and Applied Basic Research Foundation (2023A1515110742, 2023B1515120044, 2024B1515020051), Shanghai Municipal Education Commission (2024AI01007), and Science and Technology Commission of Shanghai Municipality (QNKJ2024031).
Data availability
The relevant raw data and codes that produce all relative video and imaging results supporting this work are publicly available on public repository Zenodo: 10.5281/zenodo.14087848.
Conflict of interest
The authors declare no competing interests.
Supplementary information
The online version contains supplementary material available at https://doi.org/10.1038/s41377-025-01931-w.
1. Stantchev, RI et al. Noninvasive, near-field terahertz imaging of hidden objects using a single-pixel detector. Sci. Adv.; 2016; 2, 2016SciA..2E0190S e1600190.
2. Radwell, N et al. Single-pixel infrared and visible microscope. Optica; 2014; 1, pp. 285-289.2014Optic..1.285R
3. Manley, M. Near-infrared spectroscopy and hyperspectral imaging: non-destructive analysis of biological materials. Chem. Soc. Rev.; 2014; 43, pp. 8200-8214.
4. Shi, LY et al. Transmission in near-infrared optical windows for deep brain imaging. J. Biophotonics; 2016; 9, pp. 38-43.
5. Kenry,; Duan, YK; Liu, B. Recent advances of optical imaging in the second near-infrared window. Adv. Mater.; 2018; 30, 1802394.
6. Zhang, ZB; Ma, X; Zhong, JG. Single-pixel imaging by means of Fourier spectrum acquisition. Nat. Commun.; 2015; 6, 2015NatCo..6.6225Z 6225.
7. Edgar, MP et al. Simultaneous real-time visible and infrared video with single-pixel detectors. Sci. Rep.; 2015; 5, 2015NatSR..510669E 10669.
8. Sun, MJ et al. Single-pixel three-dimensional imaging with time-based depth resolution. Nat. Commun.; 2016; 7, 2016NatCo..712010S 12010.
9. Altmann, Y et al. Quantum-inspired computational imaging. Science; 2018; 361, eaat2298.
10. Meng, HY et al. Quantum dot-enabled infrared hyperspectral imaging with single-pixel detection. Light; 2024; 13, 121.
11. Sun, B et al. 3D computational imaging with single-pixel detectors. Science; 2013; 340, pp. 844-847.2013Sci..340.844S
12. Kirmani, A et al. First-photon imaging. Science; 2014; 343, pp. 58-61.2014Sci..343..58K
13. Huynh, N et al. Single-pixel optical camera for video rate ultrasonic imaging. Optica; 2016; 3, pp. 26-29.2016Optic..3..26H
14. Phillips, DB et al. Adaptive foveated single-pixel imaging with dynamic supersampling. Sci. Adv.; 2017; 3, 2017SciA..3E1782P e1601782.
15. Soldevila, F et al. Phase imaging by spatial wavefront sampling. Optica; 2018; 5, pp. 164-174.2018Optic..5.164S
16. Xu, ZH et al. 1000 fps computational ghost imaging using LED-based structured illumination. Opt. Express; 2018; 26, pp. 2427-2434.2018OExpr.26.2427X
17. Zhao, JP et al. Compressive ultrafast pulse measurement via time-domain single-pixel imaging. Optica; 2021; 8, pp. 1176-1185.2021Optic..8.1176Z
18. Liu, YF et al. Optical single-pixel volumetric imaging by three-dimensional light-field illumination. Proc. Natl Acad. Sci. USA; 2023; 120, e2304755120.
19. Bian, LH et al. High-resolution single-photon imaging with physics-informed deep learning. Nat. Commun.; 2023; 14, 2023NatCo.14.5902B 5902.
20. Wang, YQ et al. Mid-infrared single-pixel imaging at the single-photon level. Nat. Commun.; 2023; 14, 2023NatCo.14.1073W 1073.
21. Zeng, BB et al. Hybrid graphene metasurfaces for high-speed mid-infrared light modulation and single-pixel imaging. Light; 2018; 7, 51.
22. Stantchev, RI et al. Real-time terahertz imaging with a single-pixel detector. Nat. Commun.; 2020; 11, 2020NatCo.11.2535S 2535.
23. Li, JX et al. Rapid sensing of hidden objects and defects using a single-pixel diffractive terahertz sensor. Nat. Commun.; 2023; 14, 2023NatCo.14.6791L 6791.
24. Li, WL et al. Dual-color terahertz spatial light modulator for single-pixel imaging. Light; 2022; 11, 191.
25. Liang, JY et al. Spatially Fourier-encoded photoacoustic microscopy using a digital micromirror device. Opt. Lett.; 2014; 39, pp. 430-433.2014OptL..39.430L
26. Huynh, N et al. Single-pixel camera photoacoustic tomography. J. Biomed. Opt.; 2019; 24, 121907.2019JBO..24l1907H
27. Clemente, P et al. Compressive holography with a single-pixel detector. Opt. Lett.; 2013; 38, pp. 2524-2527.2013OptL..38.2524C
28. Martínez-León, L et al. Single-pixel digital holography with phase-encoded illumination. Opt. Express; 2017; 25, pp. 4975-4984.2017OExpr.25.4975M
29. González, H et al. High sampling rate single-pixel digital holography system employing a DMD and phase-encoded patterns. Opt. Express; 2018; 26, pp. 20342-20350.2018OExpr.2620342G
30. Baek, Y; Park, Y. Intensity-based holographic imaging via space-domain Kramers–Kronig relations. Nat. Photonics; 2021; 15, pp. 354-360.2021NaPho.15.354B
31. Tian, L et al. Computational illumination for high-speed in vitro Fourier ptychographic microscopy. Optica; 2015; 2, pp. 904-911.2015Optic..2.904T
32. Liu, SY; Mulligan, JA; Adie, SG. Volumetric optical coherence microscopy with a high space-bandwidth-time product enabled by hybrid adaptive optics. Biomed. Opt. Express; 2018; 9, pp. 3137-3152.
33. Higham, CF et al. Deep learning for real-time single-pixel video. Sci. Rep.; 2018; 8, 2018NatSR..8.2369H 2369.
34. Edgar, MP; Gibson, GM; Padgett, MJ. Principles and prospects for single-pixel imaging. Nat. Photonics; 2019; 13, pp. 13-20.2019NaPho.13..13E
35. Stellinga, D et al. Time-of-flight 3D imaging through multimode optical fibers. Science; 2021; 374, pp. 1395-1399.2021Sci..374.1395S
36. Jiang, WJ et al. 2,000,000 fps 2D and 3D imaging of periodic or reproducible scenes with single-pixel detectors. Photonics Res.; 2022; 10, pp. 2157-2164.
37. Wu, DX et al. Imaging biological tissue with high-throughput single-pixel compressive holography. Nat. Commun.; 2021; 12, 2021NatCo.12.4712W 4712.
38. Hahamovich, E et al. Single pixel imaging at megahertz switching rates via cyclic Hadamard masks. Nat. Commun.; 2021; 12, 2021NatCo.12.4516H 4516.
39. Kilcullen, P; Ozaki, T; Liang, JY. Compressed ultrahigh-speed single-pixel imaging by swept aggregate patterns. Nat. Commun.; 2022; 13, 2022NatCo.13.7879K 7879.
40. Goda, K; Tsia, KK; Jalali, B. Serial time-encoded amplified imaging for real-time observation of fast dynamic phenomena. Nature; 2009; 458, pp. 1145-1149.2009Natur.458.1145G
41. Diebold, ED et al. Digitally synthesized beat frequency multiplexing for sub-millisecond fluorescence microscopy. Nat. Photonics; 2013; 7, pp. 806-810.2013NaPho..7.806D
42. Wu, JL et al. Ultrafast laser-scanning time-stretch imaging at visible wavelengths. Light; 2017; 6, e16196.
43. Teng, JJ et al. Time-encoded single-pixel 3D imaging. APL Photonics; 2020; 5, 020801.2020APLP..5b0801T
44. Schraivogel, D et al. High-speed fluorescence image–enabled cell sorting. Science; 2022; 375, pp. 315-320.2022Sci..375.315S
45. Liu, JD et al. Swept coded aperture real-time femtophotography. Nat. Commun.; 2024; 15, 2024NatCo.15.1589L 1589.
46. Lei, C et al. High-throughput imaging flow cytometry by optofluidic time-stretch microscopy. Nat. Protoc.; 2018; 13, pp. 1603-1631.
47. Han, YY et al. Cameraless high-throughput three-dimensional imaging flow cytometry. Optica; 2019; 6, pp. 1297-1304.2019Optic..6.1297H
48. Zhang, ZM et al. A high-throughput technique to map cell images to cell positions using a 3D imaging flow cytometer. Proc. Natl Acad. Sci. USA; 2022; 119, e2118068119.
49. Chan, ACS et al. Arbitrary two-dimensional spectrally encoded pattern generation—a new strategy for high-speed patterned illumination imaging. Optica; 2015; 2, pp. 1037-1044.2015Optic..2.1037C
50. Feldkhun, D et al. Focusing and scanning through scattering media in microseconds. Optica; 2019; 6, pp. 72-75.2019Optic..6..72F
51. Wei, XM et al. Real-time frequency-encoded spatiotemporal focusing through scattering media using a programmable 2D ultrafine optical frequency comb. Sci. Adv.; 2020; 6, eaay1192.2020SciA..6.1192W
52. Tajahuerce, E et al. Image transmission through dynamic scattering media by single-pixel photodetection. Opt. Express; 2014; 22, pp. 16945-16955.2014OExpr.2216945T
53. Durán, V et al. Compressive imaging in scattering media. Opt. Express; 2015; 23, pp. 14424-14433.2015OExpr.2314424D
54. Lee, WB et al. COMPU-EYE: a high resolution computational compound eye. Opt. Express; 2016; 24, pp. 2013-2026.2016OExpr.24.2013L
© The Author(s) 2025. This work is published under http://creativecommons.org/licenses/by/4.0/ (the "License"). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.