RESEARCH ARTICLE

Improved phase retrieval in holographic data storage based on a designed iterative embedded data

  • Changyu YU 1 ,
  • Suping WANG 1 ,
  • Ruixian CHEN 1 ,
  • Jianying HAO 1 ,
  • Qijing ZHENG 1 ,
  • Jinyu WANG 1 ,
  • Xianying QIU 1 ,
  • Kun WANG 1,2 ,
  • Dakui LIN 1 ,
  • Yi YANG 1 ,
  • Hui LI 1,3 ,
  • Xiao LIN , 1,4 ,
  • Xiaodi TAN , 1
Expand
  • 1. Information Photonics Research Center, College of Photonic and Electronic Engineering, Fujian Normal University, Fuzhou 350117, China
  • 2. Fujian Provincial Key Laboratory of Photonics Technology, Fuzhou 350117, China
  • 3. Key Laboratory of OptoElectronic Science and Technology for Medicine of Ministry of Education, Fuzhou 350117, China
  • 4. Wuhan National Laboratory for Optoelectronics, Wuhan 430074, China

Received date: 10 Mar 2021

Accepted date: 17 May 2021

Published date: 15 Dec 2021

Copyright

2021 Higher Education Press

Abstract

Embedded data are used to retrieve phases quicker with high accuracy in phase-modulated holographic data storage (HDS). We propose a method to design an embedded data distribution using iterations to enhance the intensity of the high-frequency signal in the Fourier spectrum. The proposed method increases the anti-noise performance and signal-to-noise ratio (SNR) of the Fourier spectrum distribution, realizing a more efficient phase retrieval. Experiments indicate that the bit error rate (BER) of this method can be reduced by a factor of one after 10 iterations.

Cite this article

Changyu YU , Suping WANG , Ruixian CHEN , Jianying HAO , Qijing ZHENG , Jinyu WANG , Xianying QIU , Kun WANG , Dakui LIN , Yi YANG , Hui LI , Xiao LIN , Xiaodi TAN . Improved phase retrieval in holographic data storage based on a designed iterative embedded data[J]. Frontiers of Optoelectronics, 2021 , 14(4) : 529 -539 . DOI: 10.1007/s12200-021-1218-0

1 Introduction

With the development of information technology, extensive data are being generated at all times. Traditional data storage technology has difficulty coping with the increasing demand for data growth [1,2]. In 1963, Van Heerden formally proposed the concept of holographic data storage (HDS) [3]. HDS technology, as a three-dimensional volume storage technology, has garnered renewed attention because of its ultra-high storage density and ultra-fast data transfer rate [46].
Traditional HDS technology uses amplitude modulation, but the code rate is relatively low. In contrast, phase-modulated HDS can increase the code rate [7,8] and the recording signal-to-noise ratio (SNR), which is conducive to further increasing the storage density; however, the camera cannot read the phase information directly. Typically, interferometric and non-interferometric phase-retrieval methods are used to decode the phase information. However, the interferometric phase decoding method suffers from problems, such as system complexity and instability of detection results [911], whereas non-interferometric phase-retrieval methods usually have a simpler and more stable detection system [12]. There exist many non-interferometric phase-retrieval methods, such as the ptycholographic iterative engine (PIE) algorithm [13,14], transport-of-intensity equation (TIE) [15,16], and iterative Fourier transform (IFT) algorithms [17,18]. This study employs the IFT algorithm to retrieve phase owing to its simpler and more stable system and single-shot operation.
The traditional IFT algorithm requires a multitude of iterations; thus, to reduce the number of iterations, the addition of available embedded data is proposed to enhance the constraint condition [19]. In actual experiments, iterations are typically more than simple simulations because the noise generated by the optical system and electrical components affect the Fourier intensity distribution [20]. The Fourier intensity distribution includes high and low-frequency information. Although the high-frequency information of the Fourier intensity aids in more accurate and faster phase retrieval, its intensity is weaker, and the noise immunity is worse than low-frequency information. Therefore, this paper proposes a method to encode embedded patterns to improve the SNR of high-frequency information in the Fourier intensity distribution and thus further increase the efficiency of phase retrieval.

2 Theory and methods

The model of the HDS and phase-retrieval system with embedded data is shown in Fig. 1.
Fig.1 Diagram of holographic data storage (HDS) and phase-retrieval system. SLM: spatial light modulator, CCD: charge-coupled device

Full size|PPT slide

In Fig. 1, the signal phase pattern is composed of unknown and embedded phase data. Unknown data refers to the valid information that needs to be recorded. Embedded data are known data, which can accelerate the phase reconstruction efficiency. In the recording process, the signal phase pattern was uploaded on the spatial light modulator (SLM) as the signal beam interferes with the reference beam, thereby recording a hologram in the media. Thereafter, in the reading process, the same reference beam illuminates the hologram, and the reconstructed beam is diffracted. Furthermore, to read out the phase information of the reconstructed beam, a single lens was used, and the charge-coupled device (CCD) captured the Fourier intensity distribution of the reconstructed beam. Finally, by using the IFT algorithm [12], the unknown phase data in the signal phase pattern was retrieved.
From previous studies, we discussed the proportion of embedded data, where 50% of the embedded data are appropriate [12]. Therefore, we used the 4-level phase data shown in Fig. 2(a) as the signal phase pattern, which is composed of 50% of unknown and embedded phase data each, as shown in Figs. 2(b) and 2(c), respectively. The Fourier intensity distribution captured by the CCD and its three-dimensional distribution is shown in Figs. 2(d) and 2(e), respectively. The red curve in Fig. 2(e) represents the intensity envelope in the Fourier plane.
Fig.2 Process of generating intensity distribution in the Fourier plane. (a) Signal phase pattern. (b) Unknown phase data. (c) Embedded phase data. (d) Intensity in the Fourier plane. (e) Three-dimensional intensity distribution in the Fourier plane

Full size|PPT slide

In Fig. 2, it is intuitively evident that the high-frequency information intensity is relatively low compared to the low-frequency information intensity. The experiments performed indicated that the noise in Fourier intensity gradually decays from low to high-frequency, and thus a similar noise was added to the simulation, as shown in Fig. 3(a). The Fourier intensities before and after the addition of noise are shown in Figs. 3(b) and 3(c), and the corresponding three-dimensional envelope distributions are shown in Figs. 3(d) and 3(e). Furthermore, the calculated value of the SNR of the low-frequency part (in the red frame) is 7.52, and that of the high-frequency part (outside the red frame) is 0.38, which renders the high-frequency information susceptible when there is noise. High-frequency information can aid in retrieving the phase quicker with high accuracy; therefore, the aim of our method is to increase the intensity of the high-frequency information by encoding the embedded phase pattern.
Fig.3 Fourier intensity and its corresponding three-dimensional envelope distribution with or without noise. (a) Gaussian model noise. (b) and (d) Without noise. (c) and (e) With noise

Full size|PPT slide

The corresponding relationship between the phase pattern and the envelope intensity distribution is shown in Fig. 4.
Fig.4 Effect of changing the embedded phase pattern on the Fourier plane intensity distribution envelope. (a) Case before the change. (b) Case after the change

Full size|PPT slide

Figures 4(a1)−4(a3) represent the unknown, previous embedded, and combined overall signal phase patterns, respectively. The envelope intensity distributions corresponding to the three-phase patterns are the red, blue, and solid yellow lines in Fig. 4(a), respectively. Figures 4(b1) and 4(b2) represent the new embedded and combined overall new signal phase patterns, respectively, and their corresponding envelope intensity distributions are represented by the blue and yellow dotted lines in Fig. 4(b), respectively.
In the actual experiment, we used a 4-level phase (0, π/2, π, 3π/2) containing 50% embedded data. Moreover, oversampling should be used to avoid the phase gap effect of adjacent pixels on the SLM. Typically, one phase of data should be displayed by 4 × 4 pixels on the SLM, such that the oversampling rate is 4. The spatial frequency on the SLM plane is the same for the unknown and previous embedded phase patterns. Therefore, their envelope intensity distributions are identical, as shown in Fig. 4(a), for the red and blue solid lines. We aim to design an envelope with a high-frequency, such as the blue dotted line shown in Fig. 4(b), and determine a new embedded phase pattern corresponding to it, as shown in Fig. 4(b1). The space-frequency spectrum of embedded data we designed needs to be higher than that of the phase data page to improve the high-frequency component of the overall Fourier spectrum.
Because the Fourier intensity envelope is related to the phase pattern, after designing the intensity envelope about the embedded phase pattern, the embedded phase pattern itself can be obtained by using the IFT algorithm. Therefore, in our method, the embedded phase pattern is not assigned directly but is generated by iterative computing according to the designed intensity envelope of the embedded data.
The method to design a new embedded data phase pattern is as follows.
We used MATLAB software to generate a three-dimensional embedded data envelope intensity distribution with higher high-frequency as the constraint condition, as shown in Fig. 5(a).
Through the IFT algorithm, the previous embedded envelope intensity distribution, as shown in Fig. 5(b), was constrained and fitted to the designed three-dimensional envelope intensity distribution gradually, as shown in Fig. 5(a). Fitting is an iterative process; thus, typically, 20 iterations were required to complete the fitting, and in each iteration, a new embedded data envelope intensity distribution was obtained, as shown in Figs. 5(c) and 5(d), and new embedded data pattern corresponding to the envelope intensity distribution was recorded, to finally obtain 20 different embedded data patterns, as shown in Figs. 5(f) and 5(g). Our purpose was to determine the embedded phase pattern corresponding to the optimal envelope intensity distribution. By combining different new embedded data patterns with the same unknown data pattern, a series of different Fourier intensity distributions can be obtained, and thereafter, using the IFT algorithm [12], the phase retrieval results of different new embedded data patterns can be compared to determine the best new embedded data pattern.
Fig.5 Schematic diagram of generating new embedded data. (a) Constrained envelope distribution. (b) Previous embedded data envelope and its corresponding (e) embedded data pattern. (c) and (d) Envelope distribution under different iteration times and its corresponding (f) and (g) new embedded data patterns

Full size|PPT slide

Different new embedded data patterns affect the efficiency of phase retrieval. The specific algorithm for generating new embedded phase patterns is as follows:
In the first step, we consider an initial guess phase distribution of the embedded data ϕn, n = 1, 2, 3,..., which represents the number of iterations, and the position of the embedded data remains unchanged during the iteration. This results in a complex amplitude distribution Un in the object domain, as shown in Eq. (1).
Un=exp( i* φn).
This is followed by the second step, wherein we obtain a complex amplitude distribution Vn on the Fourier domain after the Fourier transform, as shown in Eq. (2).
Vn={ Un }=|An|*exp(i * Φn) ,
where { } represents the Fourier transform operator, and An and Φ n represent the amplitude and phase, respectively.
Further, in the third step, we use the square root of the designed envelope intensity I to replace the amplitude An; thus, a new distribution Vn ' can be obtained, as shown in Eq. (3).
Vn'= |I|*exp (i*Φn ).
Thereafter, in the fourth step, we continue to go through the inverse Fourier transform to obtain the new complex amplitude distribution Un' on the object domain, as shown in Eq. (4).
Un'= 1{Vn'}=|A n'|*exp(i*φn'),
where 1{ } represents the inverse Fourier transform operator, and An ' and ϕn' represent the new amplitude and phase, respectively.
Finally, in the fifth step, phase-only is used as a constraint condition to rectify the complex amplitude distribution. Therefore, the amplitude was normalized to 1, resulting in a new distribution Un'', as shown in Eq. (5).
Un''=exp (i*φn ''),
where φn '' is the new embedded phase pattern.
The afore described steps were repeated several times, and after each iteration, a new embedded data phase pattern was obtained; therefore, after several iterations, a series of different new embedded data phase patterns can be obtained.

3 Simulation and experimental results

We used a 32 × 32 signal phase pattern randomly with 4-level phase (0, π/2, π, 3π/2), which contains 50% embedded data shown in Fig. 6(a), and each data was displayed in a 4 × 4 pixel block on the SLM with a pixel pitch of 20 μm. The pixel size of the CCD is 5.86 μm. According to the Nyquist sampling theorem, we chose two Nyquist size frequencies in the Fourier intensity shown in Fig. 6(b) for phase retrieval.
Fig.6 (a) Signal phase pattern and (b) Fourier intensity with two Nyquist size frequencies

Full size|PPT slide

According to the flow shown in Fig. 5, we obtained new embedded data. To distinguish the previously embedded data from the new embedded data, they were named as the assigned embedded data and the iterative embedded data, respectively. The difference in the phase patterns between these two embedded data is shown in Fig. 7. Compared with the assigned embedded data, the iterative embedded data are more complex and finer because space complexity provides more high-frequency components.
Fig.7 Schematic diagram of phase pattern of object plane. (a) Unknown data. (b) Assigned embedded data. (c) Assigned signal phase pattern. (d) Iterative embedded data. (e) Iterative signal phase pattern

Full size|PPT slide

Fig.8 Fourier intensity distributions corresponding to (a) assigned embedded data and (c) iterative embedded data, (b) and (d) are the enlargement of red boxes in (a) and (c), respectively

Full size|PPT slide

Figure 8 shows the Fourier intensity distributions corresponding to the assigned embedded and iterative embedded data. It is evident that the details of the high-frequency information in Fig. 8(d) are more abundant than those in Fig. 8(b). Following this, we also evaluated the intensity of the high- and low-frequency signals in the Fourier plane from two perspectives. It was found that when the high-frequency intensity increased, the ratio of the high-frequency intensity to the total intensity increased; whereas, when the low-frequency intensity decreased, it resulted in a decrease in the ratio of the low-frequency intensity to the total intensity.
One Nyquist size-frequency is denoted as w, and we obtained two Nyquist size frequencies in the Fourier intensity, as shown in Fig. 9(a). Further, we calculated the average intensity of the two Nyquist size frequencies to be Q.
Furthermore, we calculated the intensity of the high-frequency information. The size of the yellow box gradually reduced from 1.9 to 1.5, and to express our choice of ideas more vividly; we only chose three yellow boxes of different sizes as the explanation. The yellow boxes in Figs. 9(b)–9(d) show three different size windows: 1.9 w, 1.7w, and 1.5 w. Only the values outside these windows were retained, and thereafter the average intensity P was calculated.
In contrast, we calculated the intensity of the low-frequency information. The size of the red box was gradually increased from 0.1 w to 0.5w, and similarly, three windows were selected as an illustration. The red boxes in Figs. 9(e)–9(g) are three different size windows of 0.1 w, 0.3w, and 0.5 w. Similarly, only the values inside these windows were retained, followed by the calculation of the average intensity R.
We calculated the proportional intensity coefficients E1 and E2 to evaluate the intensity distribution of the high- and low-frequency regions according to Eq. (6).
E1 = PQ, E2= RQ.
Fig.9 Evaluate the intensity of the high-frequency and low-frequency. (a) 2w. (b) 1.9w. (c) 1.7w. (d) 1.5w. (e) 0.1w. (f) 0.3w. (g) 0.5w

Full size|PPT slide

In the afore described embedded data algorithm, by controlling the number of iterations, we can obtain different embedded data envelope distributions corresponding to different iterative embedded data patterns, and thereby obtain a series of overall Fourier intensity distributions composed of different iterative embedded data patterns along with the same unknown data pattern. In Fig. 10, we selected three cases as illustrations.
In case 1, iterative embedded data are used to reduce the high-frequency Fourier intensity. Whereas in case 2, the high-frequency intensity was slightly greater than that in case 1 by using the assigned embedded data. Finally, in case 3, the iterative embedded data was changed to render the high-frequency of the Fourier intensity slightly greater than that in case 2. Figures 10(a) and 10(b) show the proportional intensity coefficient curves corresponding to the calculation results of E1 and E2 in Eq. (6), respectively. A high value of E1 along with a low value of E2 indicates greater high-frequency information intensity, and vice-versa. E1 and E2 represent the same conclusions. However, the differences among the E1 values were small, whereas the differences among the E2 values were larger. The latter is more conducive to distinguishing different Fourier intensities; thus, we chose E2 under 0.1 times the Nyquist size and used it to express the following calculation.
Fig.10 Proportional intensity coefficients under different Nyquist spectrum. (a) E1. (b) E2

Full size|PPT slide

In the absence of noise, the assigned embedded and iterative embedded data are approximately the same for the phase retrieval because the SNR of high-frequency intensity does not decrease without noise, therefore strengthening of the high-frequency intensity is not required; thus, we only discuss the impact of the two forms of embedded data on phase retrieval in the presence of noise.
Noise exists in every process in the HDS system and deteriorates the phase retrieval directly. To simulate the influence of the experimental noise in the simulation, we added a certain amount of Gaussian model noise to the Fourier intensities formed by the assigned embedded data and iterative embedded data methods. The Gaussian model noise is shown in Fig. 3(a).
Thereafter, we used the IFT algorithm to calculate the bit error rate (BER) after 10 iterations with different Fourier intensities and determined the E2 value corresponding to the minimum BER as shown in Fig. 11, and thus we determined the corresponding embedded data pattern at this time. Smaller the E2 value, lower the low-frequency and higher the high-frequency. In Fig. 11, the black curve and red dot represent the phase-retrieval results corresponding to different iterative embedded and assigned embedded data patterns, respectively. When E2 was 6.00, the BER of the iterative embedded data method was 0.033, while for E2 of 8.31, and the BER was 0.064 in the case of the assigned embedded data method. It can be seen that phase retrieval is better when using the iterative embedded data method.
Fig.11 Bit error rate (BER) curves under different Fourier intensities

Full size|PPT slide

In Fig. 12, we select two cases where the value of the assigned embedded data E2 is 8.31, and the value of the iterative embedded data E2 is 6.00, and compare the corresponding BER curves. When we use the iterative embedded data method, the phase retrieval is better because the E2 value of the iterative embedded data method is lower than that of the assigned embedded method, which implies that the high-frequency intensity in the iterative embedded data method is higher. Under the same noise, the SNR of the assigned embedded data method was 6.0, while that of the iterative embedded data method was 6.6. Therefore, the convergence speed of the BER using the iterative embedded data method was faster.
Fig.12 Phase-retrieval results of simulation in different embedded data

Full size|PPT slide

The experimental setup of the non-interference phase-retrieval system is shown in Fig. 13.
Fig.13 Experimental setup. SF: spatial filter, HWP: half-wave plate, BS: beam splitter, SLM: spatial light modulator, L1L6: lens (L1 = 300 mm, L2L5 = 150 mm, L6 = 75 mm)

Full size|PPT slide

We used the same parameters as those in the simulation. The wavelength of the laser was 532 nm, and its power was 300 mW. The SLM is a HAMAMATSU X10468-04 with a pixel pitch of 20 μm and a resolution of 792 × 600 pixels. The CCD is Thorlabs' DCC3260M, with pixel pitch of 5.86 μm, and resolution of 1936 × 1216 pixels. The media used was a PMMA photopolymer. The phase pattern was 32 × 32 phase data based on the combination of half embedded and half unknown phase data, and we used a block of 4 × 4 pixels to denote one-phase data. Finally, a 128 × 128 pixel phase pattern was loaded onto the SLM.
In the experiment, the Fourier intensity distributions corresponding to the assigned embedded data method and the iterative embedded data method are shown in Fig. 14.
Fig.14 Fourier intensity distribution in different embedded data. (a) Iterative embedded data. (d) Assigned embedded data. (c) and (f) are part of the high-frequency detail information in (b) and (e), respectively

Full size|PPT slide

It was experimentally verified that different intensity distributions were captured on the CCD. It is evident that the high-frequency detail information in Fig. 14(c) is more abundant. The phase-retrieval results corresponding to the assigned embedded data and the iterative embedded data are shown in Fig. 15.
Fig.15 Phase-retrieval results using different embedded data in the experiment

Full size|PPT slide

In Fig. 15, after ten iterations, the iterative embedded data method can reduce the BER by approximately a factor of one. This proves that our proposed method of using embedded data to enhance the high-frequency intensity aids in improving phase retrieval.

4 Conclusions

In this study, we used the iterative embedded data method to increase the Fourier high-frequency intensity compared to the assigned embedded data method. When considering noise, the iterative embedded data method can reduce the BER by a factor of one, and the function of the known embedded data is further expanded, thus achieving faster retrieval of phase information.

Acknowledgements

This work was partially supported by the Open Project Program of Wuhan National Laboratory for Optoelectronics (No. 2019WNLOKF007) and the National Key R & D Program of China (No. 2018YFA0701800).

Disclosures

The authors declare no conflicts of interest.
1
Reinsel D, Gantz J, Rydning J. The Digitization of the World−From Edge to Core. IDC & Seagate: IDC White Paper, 2018

2
Haw M. Holographic data storage: the light fantastic. Nature, 2003, 422(6932): 556–558

DOI PMID

3
Van Heerden P J. Theory of optical information storage in solids. Applied Optics, 1963, 2(4): 393–400

DOI

4
Horimai H, Tan X, Li J. Collinear holography. Applied Optics, 2005, 44(13): 2575–2579

DOI PMID

5
Lin X, Hao J, Zheng M, Dai T, Li H, Ren Y. Optical holographic data storage-the time for new development. Opto-Electronic Engineering, 2019, 46(3): 180642

6
Lin X, Liu J, Hao J, Wang K, Zhang Y, Li H, Horimai H, Tan X. Collinear holographic data storage technologies. Opto-Electronic Advances, 2020, 3(3): 19000401

7
John R, Joseph J, Singh K. Holographic digital data storage using phase-modulated pixels. Optics and Lasers in Engineering, 2005, 43(2): 183–194

DOI

8
Psaltis D, Levene M, Pu A, Barbastathis G, Curtis K. Holographic storage using shift multiplexing. Optics Letters, 1995, 20(7): 782–784

DOI PMID

9
He M, Cao L, Tan Q, He Q, Jin G. Novel phase detection method for a holographic data storage system using two interferograms. Journal of Optics. A, Pure and Applied Optics, 2009, 11(6): 065705

DOI

10
Jeon S H, Gil S K. 2-step phase-shifting digital holographic optical encryption and error analysis. Journal of the Optical Society of Korea, 2011, 15(3): 244–251

DOI

11
Xu X F, Cai L Z, Wang Y R, Meng X F, Zhang H, Dong G Y, Shen X X. Blind phase shift extraction and wavefront retrieval by two-frame phase-shifting interferometry with an unknown phase shift. Optics Communications, 2007, 273(1): 54–59

DOI

12
Lin X, Huang Y, Shimura T, Fujimura R, Tanaka Y, Endo M, Nishimoto H, Liu J, Li Y, Liu Y, Tan X. Fast non-interferometric iterative phase retrieval for holographic data storage. Optics Express, 2017, 25(25): 30905–30915

DOI PMID

13
Pan X, Liu C, Lin Q, Zhu J. Ptycholographic iterative engine with self-positioned scanning illumination. Optics Express, 2013, 21(5): 6162–6168

DOI PMID

14
Gureyev T E, Roberts A, Nugent K A. Phase retrieval with the transport-of-intensity equation: matrix solution with use of Zernike polynomials. Journal of the Optical Society of America. A, Optics, Image Science, and Vision, 1995, 12(9): 1932–1942

DOI

15
Gureyev T E, Nugent K A. Rapid quantitative phase imaging using the transport of intensity equation. Optics Communications, 1997, 133(1–6): 339–346

DOI

16
Volkov V V, Zhu Y, De Graef M. A new symmetrized solution for phase retrieval using the transport of intensity equation. Micron (Oxford, England), 2002, 33(5): 411–416

DOI PMID

17
Fienup J R. Phase retrieval algorithms: a comparison. Applied Optics, 1982, 21(15): 2758–2769

DOI PMID

18
Fienup J R. Reconstruction of a complex-valued object from the modulus of its Fourier transform using a support constraint. Journal of the Optical Society of America. A, Optics and Image Science, 1987, 4(1): 118–123

DOI

19
Lin X, Huang Y, Li Y Y, Liu J, Liu J, Kang R, Tan X. Four-level phase pair encoding and decoding with single interferometric phase retrieval for holographic data storage. Chinese Optics Letters, 2018, 16(3): 032101

DOI

20
Liu J, Zhang L, Wu A, Tanaka Y, Shigaki M, Shimura T, Lin X, Tan X. High noise margin decoding of holographic data page based on compressed sensing. Optics Express, 2020, 28(5): 7139–7151

DOI PMID

Outlines

/