Diamond magnetometry and gradiometry towards subpicotesla DC field measurement Chen Zhang 0 Farida Shagieva 7 Matthias Widmann 0 Michael Ku¨bler 0 Vadim Vorobyov 0 Polina Kapitanova 2 Elizaveta Nenasheva 4 Ruth Corkill 5 Oliver Ro¨hrle 5 Kazuo Nakamura 6 Hitoshi Sumiya 1 Shinobu Onoda 8 Junichi Isoya 3 J¨org Wrachtrup 0 3rd Institute of Physics, University of Stuttgart , Pfaffenwaldring 57, Stuttgart 70569 , Germany Advanced Materials Labotratory, Sumitomo Electric Industries, Ltd. , Itami 664-0016 , Japan Department of Physics and Engineering, ITMO University , Saint Petersburg 197101 , Russia Faculty of Pure and Applied Sciences, University of Tsukuba , Tsukuba 305-8573 , Japan Giricond Research Institute, Ceramics Co. Ltd. , Saint Petersburg 194223 , Russia Institute for Modelling and Simulation of Biomechanical Systems , Paffenwaldring 5a, Stuttgart 70569 , Germany Leading-Edge Energy System Research Institute, Fundamental Technology Dept., Tokyo Gas Co., Ltd. , Yokohama 230-0045 , Japan TTI GmbH / TGZ SQUTEC , Nobelstraße 15, Stuttgart 70569 , Germany Takasaki Advanced Radiation Research Institute, National Institutes for Quantum and Radiological Science and Technology , Takasaki 370-1292 , Japan -

tiple diamond DC magnetometry methods are investigated. Pulsed schemes with continuous excitation and readout are demonstrated with a better sensitivity potential than optimized CW-ODMR measurements. On the other hand, we also show that the CW-ODMR driving both hyperfine and double resonance (DR) transitions demonstrate optimized sensitivity close to Ramsey measurements with the same low-power laser excitation.

We present a measurement showing the minimum detectable field at 2 − 3 pT within the bandwidth of 0 − 200 Hz in 73 s measurement time, and the 1 Hz normalized noise spectral density is 17 pT/√Hz. We also present a gradiometry measurement, showing the minimum detectable differential field noise at 4 − 6 pT with the 73 s measurement time. Furthermore, by applying a ferrite flux guide (FG), the measured minimum detectable field goes down to subpicotesla at 0.3 − 0.7 pT over the mentioned bandwidth, corresponding to a sensitivity level at 2.6 − 6 pT/√Hz.

II.

EXPERIMENT

The optimization of diamond magnetometry sensitivity has been reviewed in detail recently [ 31 ]. NV− magnetometry is based on the Zeeman shift of its ground state sublevels. The S = 1 spin can be initialized by green laser pumping, and the triplet ground state can be resonantly driven by microwaves (MW). The typical way of measuring the Zeeman shift is to detect the fluorescence change induced by the sublevel population difference.

The intrinsic noise of the magnetometer can be defined as δB = σ(t)/(dS/ dB), where σ(t) is the detected noise floor excluding the external field noise, and the scalar factor dS/ dB is the change in signal dS occurring per magnetic field change dB. In NV− ensemble magnetometry, the noise characteristic is essentially determined by the characteristics of the NV− ensembles. The minimum detectable field scales with the sensing volume as well as the measurement time [ 4 ]. In this work, by applying an optimized diamond sample with narrow linewidth NV− ensembles, we mainly focus on compromising the signal level and the technical noise and feasibility.

Figure 1 shows the experimental setup and the energy level diagram for the photo- and spin kinetic simulations [ 32 ]. A magnetic shield cube entirely encloses the setup to attenuate the magnetic field noise in the lab. The optical compound parabolic concentrator (CPC) is used for high fluorescence collection efficiency, which exceeds 60% [ 4 ]. The dielectric resonator antenna generates a uniform driving MW field for the NV ensemble [ 33 ]. We investigate the performance of a gradiometer by constructing two identical diamond sensors, as shown in Fig. 1(b).

A loop antenna is applied in the reference channel. In order to test the sensitivity enhancement by the FG, a ferrite rod (MN60) is placed between the diamond and a coil applying the test field, as shown in Fig. 1(c). The diameter of the FG tip is 2 mm. The gap between the FG and the diamond is roughly 1 mm, limited by the CPC lens structure. The FG-assisted enhancement could be higher with its decreasing distance to the diamond.

One of the most critical techniques in sensitive measurements is lock-in detection to avoid low-frequency 1/f noise. Figure 2(a) shows the typical CW-ODMR output from the lock-in amplifier (LIA), with the NV centers spin transitions driven by a modulated MW field.

The MW modulation phases are of opposite sign for the |0i → |+1i and |0i → |−1i transitions [ 30 ]. Thus, the signals S−1 and S+1, which are induced by the commonmode line shift δcm, have the same amplitude but different signs. The common-mode line shifts are induced mainly by thermal fluctuations. By tracking the resonance lines in cases of larger magnetic fields, one can maintain the maximum scalar factor and the linearity of the system.

Figure 2(b) shows a Ramsey sequence as an example for pulsed measurement sequences using a LIA, while Fig. 2(c) depicts the signal dynamics together with the acquisition methods. In the up to now used Ramsey measurements [ 2 ], the laser is turned off during the field acquisition time τm. The fluorescence signal is therefore gated and appears as pulses. However, the signal dS only amounts to a few percent of the overall signal. The background fluorescence consumes most of the bits of the digitizer when using normal photodetectors. An alternative could be to use a balancing detector. However, when the two channels are not perfectly balanced due to stability reasons, the left pulsed signals still induce overshoot and delay errors in the electronics. Besides, the thermal dynamics induced by the laser pulses contribute to additional errors in the measurement. Thus, we have modified the sequence to a continuous excitation mode with the laser turned on all the time, as shown in Fig. 2(b). The fluorescence during τm and MW pulses time can be neglected due to the low repolarization rate under the low-power laser excitation. One of the apparent advantages of using continuous excitation is the simplicity of the laser optics, e.g., an optical switch is no longer required even for MW pulsed measurements. By simply applying AC coupling to the digitizer, any background signal can be removed without noises induced by the pulsed signals. Moreover, the photodetectors do not need wide bandwidth under continuous excitation for the high dynamic pulse signals so that the gain could increase for a better signal-noise ratio (SNR). Heating fluctuation induced by the pulsed laser can be removed, and thermal stability can be improved by monitoring the laser power drifts for compensation. The disadvantage is that the laser repolarizes the NV center during τm, which reduces dS. In a quantitative treatment of the measurements, this is treated as a lower polarization rate, i.e., it reduces the contrast a bit but does not induce error to the phase accumulation. We would like to note that the continuous excitation mode can also be applied in other pulsed measurement schemes, such as Hahn-echo and high order dynamical decoupling sequences. The black curve in Fig. 2(c) is the fluorescence signal for a cycle time of 1 ms. The labeled grey areas and the inset formulas describe the acquisition method which used in previous work [ 4 ]. For the sensitivity calculation using the signal S, we need to consider the scaling of noise with the factor pTseq/Δt, where Tseq is the sequence time, and Δt is the total acquisition time. On the other hand, the lockin detection is equivalent to a method, in which SA and SB are the averaged signal level of each half reference cycle, and S is the amplitude of the entire cycle. The LIA integrates all the detected fluorescence so that there is no acquisition time intermittency, which deteriorates the sensitivity. Therefore, the measured minimum detectable field, i.e., shot noise of the Ramsey measurement can be expressed as [ 2, 31 ] where τm is optimized from the dephasing time T2∗, Cdet is the detected contrast, N is the number of photons collected per measurement, and t is the measurement time.

Experimentally N can be estimated by the photon detection rate R = N/Tseq. The sensitivity is calculated by η = δB√t. The equation calculates the shot noise without explicitly including the photon detection efficiency.

When the laser power is not saturating the NV ensembles, as in our case of low optical pumping, the collected photon rate is also related to the pumping power. In Fig. 2(c) the dotted line shows the fluorescence signal with 250 μs cycle time, and the dashed line is the corresponding lock-in reference. When the time duration for each sequence part is significantly shorter than the repolarization time of the spins, the contrast reduces dramatically due to insufficient repolarization. The contrast increases with higher pumping power when polarization time Tp is fixed. In this case, Tseq can be shorter to make the magnetometer more sensitive. However, latterly we will see that laser power of Watts is required to get a significant improvement of sensitivity. In order to make it more clear in the model, we denote the contrast as C(P, Tp), where P is the applied laser power. Note that the contrast also decays with the phase factor exp (−τm/T2∗), the detected contrast Cdet = C(P, Tp) exp (−τm/T2∗). Meanwhile, improving the laser power leads to higher fluorescence signals due to more optical cycles, and it also contributes to a linear rising in laser noise. It becomes rather complicated to get better sensitivity in practical then. Thus, we optimize with a fixed low laser power for the trade-off between the technical noise floor and the signal contrast, which both decrease with higher demodulation frequencies.

Among all the diamond DC magnetometry schemes, Ramsey magnetometry is among the most sensitive methods. However, its practical sensitivity suffers from technical limitations. For example, enormous laser power is needed for the initialization of the spin ensembles. It brings heating problems and extra noise with it. In contrast, CW-ODMR is a method that can be easily implemented with low laser power. In the following, we first analyze the CW-ODMR sensitivity and the optimal laser and MW parameters. Then, Ramsey magnetometry is investigated based on the same laser power. The shot noise measured by the CW-ODMR method is described as [ 34 ] δBCW = PF h ν(s, ΩR)

√ gμB Cdet(s, ΩR) Rt .

(2) Here PF is a lineshape dependent factor, which is 0.77 for the Lorentzian line profile in this work. The critical parameters for experimental optimization are the linewidth ν and the contrast Cdet, which are dependent on the Rabi frequency ΩR, and the ratio of the applied laser power to the saturation power, s. Figure 3(a) plots the sensitivity dependence on the parameters. T1 and T2∗ used in the calculation are 6 ms and 8.5 μs, respectively. The optimized magnetometry parameters are ΩR = 23 kHz, and s = √3× 10−4. The shot noise limited sensitivity is 2.86 pT/ Hz for the signal of a single hyperfine line. By driving the three hyperfine lines, the contrast ideally increases three times. However, the contrast enhancement factor in the experiment is smaller because of reasons such as the near-resonant driving induced by the multiMW-frequencies. The effect could be limited by applying weak MW driving. In our experiments, the contrast enhancement factor is 2.67. The scalar factor is further improved by DR driving, and the experimental enhancement factor is 1.3 times. Therefore, the sensitivity limit of the CW-ODMR method upon driving all the hyperfine lines and using DR driving is expected to be 0.82 pT/√Hz. The estimated sensitivity limit based on calculations in Fig.3(a) does not include the influence of MW modulation. MW modulation reduces the contrast due to the magnetometer frequency response, depending on the optical pumping. In the following, we will discuss the magnetometry parameters at the near optimized optical pumping power.

In order to have an optimized SNR, the collected fluorescence emission power should be on the order of mW with roughly 3 × 10−4 lower than the saturation laser power. In order to avoid applying an unnecessarily high laser power, the excitation power is optimized by comparing the experimental pumping rate with the simulation of the excitation kinetics. With a fluorescence emission rate of 66 MHz [ 35, 36 ], the optimized pumping rate is estimated to be around 0.02 MHz. This parameter underestimates the pumping rate due to neglecting charge state conversion in the kinetics [ 37, 38 ]. We apply a near-optimal laser power with the pumping rate k = 0.026 MHz. Figure 3(b) plots the experimental fluorescence recovery curve and the simulated results with different parameters. The low pumping rate is the main reason for the millisecond repolarization time. Hence, in the low pumping rate case, the influence from T1 appears on the fluorescence recovery. We take the parameter linewidth/contrast to optimize the MW power, as plotted in Fig. 3(c). We find the optimal ΩR to be around 17 kHz, while the applied laser power is roughly 80 mW with the beam size same as the diamond dimension of 0.5 mm. In order to investigate the contrast response to the MW modulation frequency, i.e., the magnetometer bandwidth, the calculated fluorescence responses to different MW modulation frequencies are plotted in Fig. 3(d). The demodulated output magnitude of the fluorescence drops from 1 kHz to 20 kHz modulation frequency by roughly 10 times due to the optical pumping dynamics and T1. Thus, with the MW modulation frequency higher than 1 kHz, the optimal sensitivity also degrades accordingly. The 3 dB bandwidth of the magnetometer is around 1.5 kHz at a pumping rate of 0.026 MHz.

It increases to 7.4 kHz with a lower contrast when the pumping rate is 10 times higher. In addition, with a roughly 10 times shorter T1, the contrast also decreases.

However, the 3 dB bandwidth only increases to 1.7 kHz.

The SNR can be improved by MW modulation within the 3 dB bandwidth since the technical noise floor decreases with the increasing demodulation frequency. After the 3 dB frequency point, it is worth investigating if the SNR can be further improved by reducing both the signal amplitude and the noise, which is discussed below.

RESULTS AND DISCUSSIONS

Figure 4(a) shows a comparison of the linewidth between CW-ODMR, pulsed-ODMR, and continuously excited pulsed-ODMR (CE-pulsed-ODMR) at different Rabi frequencies. The diamond demonstrates a narrow CW-ODMR linewidth of 28 kHz FWHM with a weak MW driving, while the calculated minimum linewidth of the diamond with T2∗ = 8.5 μs is < 20 kHz [ 34 ]. The CWODMR shows the linewidth broadening with strong MW fields. While the CW-ODMR linewidth reduces with the MW power, the linewidths of the pulsed-ODMR and the CE-pulsed-ODMR measurement stay unchanged when the π pulse width exceeds T2∗. Figure 4(a) also shows that pulsed-ODMR and CE-pulsed-ODMR are having almost the same linewidth. In order to determine whether the contrast reduction in CE-pulsed magnetometry degrades the sensitivity, we optimize the CW-ODMR sensitivity as a benchmark for comparison with the sensitivity of the CE-Ramsey measurement. We expect that CE-Ramsey combines the narrow linewidth with the advantage of a good SNR due to the continuous readout. All the following measurements are based on the near-optimal laser power for the CW-ODMR method, which can be easily achieved in the laboratory. Both the simulations and experimental results show that CE-Ramsey and regular Ramsey measurement yields similar results with the lowpower laser [ 32 ].

MW angular modulation is implemented as discussed in Sec. II. Either frequency modulation (FM) or phase modulation (PM) can be applied. Taking the FM MW signal as the example, the signal can be described as

SF M = A cos[2πf0t + β sin(2πfmt)],

(3) where f0 is the resonance frequency, fm is the modulation frequency, β = fd/fm is defined as the modulation index in which fd is the frequency modulation amplitude. In the case of PM, the modulated phase term is φd sin(2πfmt), and the signal SP M can be expressed as (3) by replacing β with the phase modulation amplitude φd. In applications, the FM signal is usually more precise than the PM signal due to the high frequency resolution of instrumentation. However, the PM signal provides a more stable phase reference, which is essential when applying a multi-frequency MW signal for the DR driving with lock-in detection. Another parameter that is worth discussing is the bandwidth of the modulated MW signal. The bandwidth of the modulated MW signal, i.e., Carson bandwidth, is defined as BWF M = 2(β + 1)fm = 2(fd + fm), and the FM signal can be decomposed as

SF M = A ∞ X Jn(β) sin[2π(f0 + nfm)t],

(4) n=−∞ where Jn(β) is the Bessel function of the first kind. Figure 4(b) plots the maximum scalar factor of the lock-in ODMR spectra for fm = 1 kHz, 5 kHz, 10 kHz, and 20 kHz, and fd ranging from 1 kHz to 20 kHz. The main reason for the decrease of the measured scalar factors with fm is the bandwidth of the diamond magnetometer, as shown in Fig. 3(d). The lines are fitted according to the ODMR spectrum profile. The fitting only agrees well when fd + fm is small but becomes more deficient with increasing fm or fd when BWF M > FWHM. Figure 4(c) plots the measured maximum scalar factors as a function of increasing fm with different φd = β to investigate the parameter dependence when applying the PM MW signal. The Carson bandwidth in the form of phase modulation is BWP M = 2(φd +1)fm. The scalar factor firstly rises because of the equivalent increasing fd = fm·φd. For larger modulation frequencies, the scalar factor is mainly limited by the magnetometer bandwidth. An additional reduction of the signal due to the modulated MW signal bandwidth can be found when fm is close to 20 kHz with β = 4, BWF M = 100 kHz.

The CE-Ramsey scalar factor is also plotted in Fig. 4(c) for different modulation frequencies together with CW-ODMR data. It is compared with another CWODMR measurement, in which the MW signal bandwidth is fixed and fd = BWF M /2–fm is changed according to fm. The applied BWF M is 100 kHz to ensure the modulated fluorescence signal amplitude maximal. As depicted in Fig. 2(c), the signal measured by CE-Ramsey is demodulated with the reference set by the measurement cycle time. Since the effective fluorescence signal is majorly at the beginning of each cycle, the LIA output is small when the demodulation frequency is low and the repolarization time is long. Thus, the scalar factor of the CE-Ramsey measurement is small at the beginning and increases with the demodulation frequency. It also makes the CW-ODMR signal superior to the Ramsey signal with a < 3 kHz demodulation frequency. When the cycle time is 250 μs, corresponding to a demodulation frequency of 4 kHz, the insufficient repolarization starts to reduce the contrast. Nevertheless, the CE-Ramsey scalar factor is still larger than the CW-ODMR over most of the investigated fm. On the other hand, the high noise level from our instrumentation used in experiments at low frequencies, as shown see in Fig. 4(d), allows neither method to achieve the optimal signal-noise ratio. One option for having a better sensitivity could be increasing the laser power to Watts. According to Fig. 2(b), the repolarization time is roughly 10 times shorter when the laser power meets k = 0.26. From Fig. 3(c), the gain of contrast is about 2 times. Taking the two factors into equation (2), the shot-noise limit of the Ramsey measurement will be roughly 6 times lower. However, a Watt laser power could introduce more technical noise that might limit the sensitivity enhancement, which needs to be further investigated.

We investigate the instrument noise floor by measuring the noise spectrum and plotting the calculated root mean square voltages at different demodulation frequencies from 1 kHz to 20 kHz. When the setup operates with the laser switched on and the microwave signal offresonant, the noise reduces by roughly 100 times from 1 kHz to around 10 kHz. However, the measured signals by each method reduce only by a few times, as shown in Fig. 4(c). In each measurement, the magnetometer outputs are acquired with the sampling rate of 900 Hz for 65536 pts, corresponding to a measurement time of 73 s. The minimum detected magnetic field is estimated by dividing the noise floor by the scalar factor. We plot the results calculated with these scalar factors in Fig. 4(d).

It shows that the optimal demodulation frequency is at 9 kHz, with the measured magnetic field noise level at 2 pT. The sensitivity value is 17 pT/√Hz with 1 Hz normalization. The Ramsey measurement runs with 4 MHz Rabi frequency and time interval τm = 6.42 μs. Meanwhile, according to the scalar factors measured in Fig. 4(c), the optimal CW-ODMR signal is only 80% of the signal measured by the CE-Ramsey method, and the optimized noise level plotted in Fig. 4(d) is roughly 2.5 pT.

The values are measured and calculated with MW driving the |0i → |−1i transition, hyperfine lines included. Considering the sensitivity enhancement factor of 1.3 times with DR driving, we expect the CW-ODMR method also measures a 2 pT noise level. In the magnetometer intrinsic noise measurements, batteries are used to generate a bias field with the coils to keep the low magnetic field noise in the shields. However, the continuously discharging of the batteries induces bias-field reduction during the measurements. The continuous shifting of resonant frequency makes it difficult to run the Ramsey measurements, which require on-resonant MW pulses. The CWODMR method is used for characterizing the intrinsic noise of the setup in this work. The bias field drift can be tracked by repeatedly sweeping the MW frequency.

In order to keep the bandwidth of the modulated MW signal approaching the ODMR linewidth in the measurement, the applied parameters are fm = 9 kHz, φd = 2, corresponding to fd = 18 kHz and BW = 54 kHz instead of the 100 kHz bandwidth used in Fig. 3(c).

The resonantly measured magnetic field noise spectrum is depicted in Fig. 5(a). The spectrum is calculated by the Fourier transform of the Lock-in amplifier output, dividing the measured scalar factor and the gyromagnetic ratio. The magnetometer has also been calibrated by applying known fields modulated at 182 kHz [ 32 ]. The magnetic field noise inside the shielding is measured by the OPM (Quspin Inc., QTFM) as a benchmark with the nominally 1 pT/√Hz sensitivity. The diamond magnetometer measures the magnetic field noise with about 10 G bias field generated by the coils powered by batteries.

The coils are turned off when the OPM measures the noise to ensure the required bias field range. The batteries ensure the coil noise lower than the residual magnetic noise inside the shields so that the OPM and the diamond magnetometer could measure the same magnetic field noise floor. The diamond magnetometer measures a noise floor of 2 − 3 pT for frequencies larger than 50 Hz. The noise spectrum at a frequency lower than 50 Hz is partly limited by the residual low-frequency noise inside the shields according to the spectrum measured by the OPM. On the other hand, the discharging of the batteries is another reason for the higher noise level of the diamond magnetometer at low frequencies. The discharging can be removed by a linear fitting so that lowfrequency noises can be well characterized. In Fig. 5(b), the spectra are calculated and plotted with the discharging signal removed, and the low-frequency noise spikes are better characterized than the spectrum in Fig. 5(a).

The off-resonant baseline in Fig. 5(a) marks the intrinsic noise level of the diamond magnetometer, which is 2 − 3 pT from 0 to 200 Hz. The noise level is consistent with the result in Fig. 4(d), which is limited by the optical noise, showing that we measure a shot-noise limited sensitivity of 17 pT/√Hz.

The most efficient way to overcome the sensitivity limit is using a flux concentrator (FC). The increase in sensitivity using a FC follows a simple geometric consideration, i.e., the ratio of the areas at the surface proximal to the diamond vs. the distal surface defines the amplification [ 30 ]. It is also possible to use a ferrite rod with a millimeter diameter as the FG [ 39 ] so that the diamond can measure a remote magnetic field with concentrated flux. Although the ferrite rod cannot have the same flux amplification factor as the FC, the flux concentration is still remarkable due to the high magnetic permeability of the ferrite material, and the FG holds the spatial resolution. Experimentally, the magnetic field generated by the coil is guided and concentrated through the rod to the diamond. The magnetic field signal induced by the battery discharging is detected as the calibration signal.

The discharging signal is a linearly decreasing field, and the slopes measured by magnetometer with and without the FG are compared. With the FG, the magnetic field signal decreases 6.3 times faster than the measurement without FG [ 32 ]. It indicates that the flux is amplified by a factor of 6.3 times. Figure 5(b) shows the comparison of the detected magnetic field noise floor. In order to have a better analysis of the low-frequency magnetic field noise, the discharging signal is removed from the signal traces by linear fitting before performing the Fourier transform. Then, spikes with frequency under 20 Hz can be seen in the spectrum, and the signals come from the cooling fan motors (typically 1000 r/min, roughly 17 Hz) of equipment outside the shields. There is a window on the shielding cube in the direction of the FG. The magnetic noise spikes are guided through the ferrite to the diamond in the measurement. The noise floor translated from the raw output of measurement with the FG is read as 3 pT. With the 6.3 times flux gain, the noise floor measured with the FG is estimated between 0.3 − 0.7 pT over the bandwidth of 20 − 200 Hz, as the spectrum shown in Fig 5(b). The 1 Hz normalized magnetic field noise spectral density is 2.6 − 6 pT/√Hz.

Using the gradiometer configuration described in Fig. 1(b), we measure the diamond magnetic gradient field noise as shown in Fig. 5(c). Respectively, the MW signal is applied to drive the transition of |0i → |+1i in the probe channel (Channel 1) and |0i → |−1i in the reference channel (Channel 2). The probe channel measures a 5 pT noise level due to the driving of a single transition. The noise level of channel 2 is 10 − 20 pT, mainly because of the MW inhomogeneity generated by the loop antenna. Nevertheless, by combing the two signals as the DR driving scheme shown in Fig. 2(a), the gradiometer suppresses the common-mode line shifts, and the differential signal is amplified. Each gradiometer channel can also operate with DR driving, for which four groups of MW frequencies are required. The differential signal can still be amplified by applying MW signals with a 180 deg phase difference regarding the two channels. The differential signal gain could be twice larger because the measurement is operated in two separate diamonds. In principle, the gradient field noise increases by √2 compared to a single-channel magnetometer. Thus, the sensitivity of the differential signal could be √2 times better.

In the measurement with single resonance driving in each channel, the measured gradiometer intrinsic noise is 4 − 6 pT in the frequency range of 20 − 200 Hz and a channel distance of roughly 10 cm.

IV.

CONCLUSIONS AND OUTLOOKS

This work demonstrates the diamond DC magnetometry and gradiometry with low optical pumping power.

Fields ranging from few picotesla to subpicotesla level are measured within a sensing volume of 0.125 mm3. We take advantage of the narrow linewidth and the low noise floor of the lock-in detection technique, using the Ramsey method with low power continuous excitation. With the optimized parameters, the CE-Ramsey measurement can achieve a better sensitivity than CW-ODMR measurement. The real field measurements indicate the intrinsic magnetic noise level of the diamond magnetometer is 2 − 3 pT with a 13.7 mHz frequency resolution, corresponding to a bandwidth normalized noise spectral density of 17 pT/√Hz. The gradiometry measurements also demonstrate a < 10 pT gradient field noise with the same frequency resolution. By applying a ferrite rod with a 2 mm end diameter as the flux guide, the noise level (spectrum calibrated by the battery discharging signal) reduces to 0.3 − 0.7 pT, corresponding to a bandwidth normalized noise spectral density of 2.6 − 6 pT/√Hz. For practical applications, one could combine the FG and the gradiometer to take the combined advantage of the sensitivity, spatial resolution, and magnetic field noise suppression. The FG could be further optimized for higher flux gain by changing its geometry and structure, such as the gap distance between the diamond and the FG end and the structure with a flux return ferrite. Moreover, upon compromising the spatial resolution, the setup could yield a magnetic field sensitivity of a few hundred fT/√Hz at ambient environment if the flux concentrator geometry is used. Techniques such as double quantum magnetometry[ 40 ], spin bath driving, multiple NV orientation driving, and close-loop measurements could advance the sensitivity and stability [ 41, 42 ]. With the demonstrated intrinsic noise level and spatial resolution, diamond magnetometer can be at the forefront for a wide range of sensing applications in classical and quantum technologies.

V.

ACKNOWLEDGMENTS

We thank Axel Griesmaier for offering excellent photodetectors. We also thank Andrej Denisenko, Jianpei Geng, Durga Dasari, and Ayman Mohamed for fruitful discussions. We acknowledge financial support by EU via the project ASTERIQS, and the ERC Advanced Grant No. 742610, SMel, and the BMBF via the project MiLiQuant and the DFG via the GRK 2198 and 2642. In addition, we acknowledge the support of the Japan Society for the Promotion of Science (JSPS) KAKENHI (No. 17H02751). SUPPLEMENTAL INFORMATION: DIAMOND

MAGNETOMETRY AND GRADIOMETRY

TOWARDS SUBPICOTESLA DC FIELD

MEASUREMENT

A. Setup details

The diamond samples are (111)-oriented (0.5 mm)3 cubes obtained by laser-cutting and polishing from a 99.97% 12C enriched single crystal. The crystal is grown by the temperature gradient method at high pressure and high temperature (HPHT) conditions. The initial nitrogen concentration of the original crystal was 1.4 ppm.

After irradiation of 2 MeV electrons and annealing (1000 ◦C for 2 h in vacuum), the NV concentration reached 0.4 ppm.

The laser used in experiments is the low noise version of the Lighthouse Sprout-G 532 nm laser. The homebuilt photodetector is based on the Hamamatsu S359009 model with an active area of 10×10 mm2 and a simple trans-impedance amplifier circuit with a gain of 5.1 kV/A. The detected fluorescence and laser reference signals directly go to the differential input of the lock-in amplifier (LIA, Zurich Instruments, HF2LI). Two vector signal generators (Rohde & Schwarz, SMIQ03B) are used to generate MWs for the double resonance (DR) driving.

In the CW-ODMR measurements, the MWs are mixed with a PM modulated RF signal generated by a function generator (Rigol DG1022), of which the second channel generates a 2.163 MHz signal mixed with the MW by a mixer (Minicircuits, ZAM-42) to drive the hyperfine lines. In the MW pulsed measurements, the MW signal is gated by a data timing generator (Tektronix, DTG5274) and a switch (Minicircuits, ZASWA-50DR+). The combined multi-frequencies MW signal is amplified by the Amplifier-Research 50S1G4 and finally applied to the diamond through the dielectric resonator antenna [ 1 ]. The bias field is generated by the home-built 3D coil system, which are powered by 12 V lead-acid batteries (XCell XP 18-12) in the noise floor measurements to keep the low magnetic field noise level. As illustrated in Fig. 1(a), the setup, including the coils, is installed in the magnetic shields. The shields consist of an inner μ-metal cube and an outer aluminum cube. In the magnetic field measurement with the flux guide, the flux guide is machined from a MN60 ferrite rod, with the permeability of 6500. All the measurements acquire signals with sampling rate of 900 Hz for 65536 pts, corresponding to about 73 s.

B. Operation protocols 1. CW-ODMR operation

The CW-ODMR operation has been described in several publications [ 2–4 ]. In the main text, we have discussed the optimization of operation parameters theoretically and experimentally. In order to determine the scalar factor when using double resonance driving, the two MW signals are swept in opposite directions centering at the two resonant frequencies with the same span.

The slope of the ODMR spectrum derivative at the resonant frequency is denoted as the scalar factor ks. The magnetic field is then measured by applying the two resonant MW frequencies synchronously, and the magnetic field is recorded as the output voltage level dividing the scalar factor.

2. CE-Ramsey operation

The CE-Ramsey operation is based on the sequence described in Fig. 2(b). The magnetic field sensing time τm = 6.42 μs is chosen according to the optimization of max: C(τm)·τm. The repolarization time τr is calculated according to the optimized demodulation frequency as τr = (Tseq − 3π/ΩR)/2 − τm.

(S1) The external magnetic field is measured by continuously running the sequence with the fixed τm. In order to suppress the thermal fluctuation, the double quantum magnetometry can also run with the CE-Ramsey scheme as well by replacing the pulses with double quantum magnetometry pulses [ 5 ]. The scalar factor of the magnetometer is determined by sweeping the MW frequency as the equivalence of magnetic field changes. S-Figure.1 shows the magnetometer response to different MW frequency offsets. Both of the results are measured with single resonance driving for comparison.

S-FIG. 1. Magnetometer response to different MW frequency offsets measured by (a) CW-ODMR method; (b) CE-Ramsey method.

C. Simulation model

the Bloch equations [ 8 ]:

The transition kinetics simulation follows the energy level diagram described in Fig. 1(d). In the diagram, the charge state conversion is neglected because the NV0 has no contribution to the valid signal. Another reason is that the low pumping rate preserves the major population on NV− states [ 6, 7 ], and the continuous excitation maintains the population of NV− and NV0 stable. The simulation includes the green laser excitation, T1 decay, and the MW driving in the single quantum (SQ) basis of the ground state. The MW driving model is based on in which ρij are elements of the density matrix, Γp is the optical pumping rate, Γ1 = 1/T1, Γ2∗ = 1/T2∗ , and Δ is the MW offset frequency. Here we take the |0i → |+1i basis. Combined (S2), (S3) with the optical pumping kinetics, the model is described as: In the model (S4) - (S13), ni is the population of state i, Rij is the transition rate from state i to state j, and Rfl = 66 MHz is the fluorescence emission rate. The intersystem crossing parameters R82 = R83 = 0.7 MHz, R81 = 1 MHz, R57 = R67 = 53 MHz, and the metastable states emission rate R78 = 1000 MHz [ 9, 10 ]. The fluorescence signal is taken as the sum of the excited states population, i.e., n4 + n5 + n6. The background is taken as the signal when the spins are fully initialized without MW.

To calculate the bandwidth of the magnetometer, we applied the modulated MW signal with frequency shift follows Δ(t, fm, fd), where fm is the modulation frequency, fd is the frequency deviation, and the modulation index β = fd/fm = φd.

D. Continuous excitation

One important question is, what would be the cost of using continuous excitation for getting away from the technique noises. Experimentally, Rabi oscillation and the free induced decay (FID) signal are measured with both pulsed laser and continuous laser excitation. Fluorescence signals are all acquired with a 10 μs gate after the MW pulses so that the calculated contrast of the two measurements can be comparative. As shown in S-Fig. 2(a) and (b), the two curves in each figure are hardly distinguishable. The influence of continuous excitation with low laser power can be neglected. In order to investigate the continuous excitation effects on interferometry measurements, we simulate the process with the model described above. According to the Bloch equations (S2) and (S3), the repolarization has an equivalent effect as T1, that it induces Rabi oscillation signal decay in the duration of MW pulse. S-Figure 2(c) plots the simulated (S3) (S4) (S5) (S6) (S7) (S8) (S9) (S10) (S11) (S12) (S13) S-FIG. 2. (a) Rabi oscillations measured by applying continuous laser compared with regular Rabi measurement using pulsed laser. (b) Results comparison of CE-Ramsey measurement and the normal Ramsey measurement. (c) Simulated results of CE-Rabi measurements with increasing optical pumping rate. The pumping rate k = 0.026 corresponds to the laser power of 80 mW. (d) FID signals calculated with CE-Ramsey sequence with increasing optical pumping rate, and ΩR = 2.5 MHz. Rabi oscillations with increasing laser pumping rate. The established Rabi frequency ΩR = 2.5 MHz. The signal decay becomes faster with a higher pumping rate. The average contrast level decreases with increasing pumping rate because the metastable state accumulates more population with appearance of MW when the pumping power is higher. With the presence of Γp and Γ1, the Rabi flopping frequency is given as ΩΓ = r Ω2R − ( Γ4p − Γ21 )2.

(S14) In the experiments, ΩR ≫ Γp, Γ1 , and there would be no significant Rabi frequency change until Γp ≈ ΩR. In any case, the pulse width error in Ramsey sequence can be experimentally compensated. According to (S3), the repolarization does not contribute to the dephasing in the kinetics during the phase integration time interval τm. S-Figure 2(d) shows the simulated FID signal of CERamsey measurements with increasing pumping rate. In the simulation, the MW frequency is on-resonant to the hyperfine line of I = 0, and the oscillation comes from the 14N nuclear spin I = ±1 precessions at ±2.2 MHz. The result shows a faster decay of contrast when the pumping rate is higher. It is because that the repolarization accumulates during τm due to the continuous pumping.

Techniques such as balancing-detector, heat sinking can also be applied for better SNR and stability. While these techniques cannot address all the issues in regular Ramsey measurements, continuous excitation can fix all the problems as mentioned in the main text. For example, a balancing-detector cannot eliminate pulsed readout when the two arms are not well balanced, but by continuous excitation there will be no background fluorescence pulses. Heat sinking cannot prevent the thermal dynamics over the diamond induced by laser pulses, and it could be a reason for linewidth broadening in bulk diamond measurements. Combining continuous excitation with a heat sinking can suppress most of the thermal fluctuations, and thermal stability can be further improved by monitoring the laser power fluctuations. In conclusion, continuous excitation using a low pumping power is beneficial with minimal influence on the signal contrast.

E. Linewidth of the diamond sample

Our diamond sample has NV ensembles with a long dephasing time of T ∗ = 8.5 μs, and it ensures the very

2 narrow ODMR linewidth. The CW-ODMR linewidth of the diamond can be estimated by [ 8 ] Δν = where Γ2 = Γ2∗ + Γc, and Γc is the optical cycles. When both MW power and laser power approach 0, the CWODMR linewidth limit can be estimated as Γ2∗/2π ≈ 19 kHz. The minimum linewidth detected experimentally is 28 kHz. It was difficult to measure the ODMR linewidth with further lower MW power because of the extremely low contrast and the linewidth broadening due to the magnetic field drifts during a long averaging time.

On the other hand, pulsed-ODMR reaches the linewidth limit when the π-pulse time Tπ ≈ T2∗. The linewidth is estimated as Δν = (2√ln 2)/(πT2∗) ≈ 62 kHz [ 8 ], while experimentally the minimum linewidth is 85 kHz. One possible reason for the linewidth broadening is that the π-pulse with a width exceeding T2* becomes inaccurate due to the dephasing limit. In the experiment, we can only estimate the π pulse width by fitting Rabi frequency to the MW power. The contrast becomes poor for further measurements also with longer π-pulse width under lower MW power.

F. Signal contrast 1. CW-ODMR contrast

This section discusses the contrast reduction induced by the MW modulation in magnetic field measurements.

Figure 3(d) in the main text shows that the magnetometer bandwidth has a dependency on the optical pumping rate. MW modulation in the CW-ODMR measurement scheme directly leads to a response of signal contrast following the figure. Experimentally, the MW modulation parameters are fm = 9 kHz, fd = φdfm = 18 kHz. Thus, the differential signal dS(f ) = S(f − fd) − S(f + fd) can be calculated basing on the regular CW-ODMR spectrum measured without MW modulation, and S(f ) is the detected fluorescence signal. The peak-peak amplitude of the differential signal dS(f ) is 17.1 mV, while the peak-peak amplitude of the lock-in detected ODMR signal with the 9 kHz MW modulation is 1.81 mV according to S-Fig. 1(a). Thus, the reduction of the CW-ODMR contrast due to MW modulation can be calculated as 9.45 times. The simulated frequency response of the diamond magnetometer plotted in Fig. 3(d) also indicates a similar contrast reduction.

2. Ramsey contrast

As for the Ramsey measurement, the main reason for the contrast reduction is the insufficient repolarization due to the short cycle time in the lock-in detection, which is discussed in Fig. 2(c). It is shown in the figure that the spins can mostly be repolarized by the 80 mW laser in 0.5 ms. The recovery curve is fitted by Ufl = −Afl exp(−t/τfl), where Afl = 105 mV is the largest fluorescence decrease, τfl = 0.1 ms is the time constant of the signal decay. When the cycle time is short, the spins cannot be fully polarized. In order to estimate the output of the LIA, we can choose a reference time t0 and get ULIA = "Z t0 (S16) where τm = 6.42 μs, is the field measurement time.

With demodulation frequency of 9 kHz, the cycle time Tseq ≈ 110 μs, and the reference time t0 is taken as 150 μs. It can be calculated that ULIA = 6.2 mV. Meanwhile, according to S-Fig. 1(b), the experimentally detected signal U L′IA = U0 − U−1 ≈ 6.3 mV, where U0 and U−1 are the fluorescence signal at the corresponding spin states.

It can be concluded that the insufficient repolarization is the reason for contrast reduction in the Ramsey measurement using lock-in detection. The Ramsey contrast is calculated as Cdet = U L′IA/(U0 + U−1) ≈ 0.17%, where the average fluorescence signal (U0 + U−1)/2 is estimated as 1.88 V according to the experimental data. S-FIG. 3. (a) is the calculated equivalent contrast with different acquisition gating time and the sequence cycle time. (b) is the measured Ceqv in experiments.

3. Continuous readout and gated readout

One of the commonly used methods to increase signal contrast in NV interferometry measurements is to gate the acquired fluorescence pulses. Intuitively, gated acquisition can avoid the invalid fluorescence detection induced by repolarization. In the FID measurements of S-Fig. 2(b), the gated readout is acquired for comparison. Each sequence time is Tseq = 200 μs, and the gating time is 10 μs. The detected signal magnitude is estimated by UG = "Z t0− Ts2eq +Δt When τm = 0, UG = 38.3 mV with the repolarization reference time t0 = 150 μs, and the signal contrast is calculated as UG/U0 ≈ 2%. On the other hand, the gated acquisition deteriorates the sensitivity by a factor of pTseq/Δt (similar sensitivity deterioration happens in Ref. [ 11 ], in which the equation 3 mistakenly writes the factor as pΔt/Tseq). Another factor of pTseq is multiplied for consideration of the magnetic field sampling rate. Therefore, we use the equivalent contrast parameter defined as Ceqv = Cdet/p(Tseq /Δt)/p(Tseq /Tref for the investigation. Ceqv includes both the detected contrast and the sequence parameters to represent the sensitivity changes with different gating time and sequence cycle time. Tref = 110 μs is used for comparison with the fm = 9 kHz measurement in this work. The gated readout is equivalent to the LIA readout when Δt = Tseq/2.

S-Figure 3(a) plots the calculated Ceqv with different gating time and sequence cycle time. The fluorescence reference time is chosen as t0 = Tseq/2 + 50 μs to represent the insufficient repolarization when Tseq is short.

Only with a long enough cycle time, there is an optimized gating time around 0.1 ms. Considering the sensitivity deterioration induced by the sequenced sampling, the acquisition with Δt = Tseq/2 and short Tseq, i.e. the lock-in acquisition used in this work, is superior. The experimentally measured contrast is plotted in S-Fig 3(b).

The measured results show the same trends as the calculation. The difference between the measured and the calculated contrast could result from the minor difference in fluorescence reference time. In conclusion, the results show that the continuous readout with a LIA is superior in sensitivity compared to the gated readout in the case of low laser power pumping.

In order to estimate the shot noise limited sensitivity, the linewidth, contrast, and photon detection rate have to be determined experimentally. According to Fig. 3(c), the measured ν/Cdet with hyperfine lines driving and DR driving is 2.37×106 Hz. To estimate the photon detection rate R, we measured the photodetector output level UP D = 3.76 V with 1 MΩ termination. Thus, the detected photon count rate is

R =

UP D GT I · q where GT I = 5100 is the transimpedence gain, and q is the electron charge. Therefore, the shot noise limited sensitivity can be estimated according to equation (2) as 0.96 pT/Hz1/2. However, this sensitivity limit is calculated without considering the contrast reduction due to the MW modulation. The parameter ν/Cdet should be corrected with a factor depending on the MW modulation frequency. According to the discussion of the CWODMR contrast in Section V F, the factor 9.45 can be considered in the estimation. As a result, the shot noise limited sensitivity of CW-ODMR measurement with 9 kHz MW modulation is estimated as 9.1 pT/Hz1/2.

Experimentally, the scalar factor measured by CWODMR with 9 kHz MW modulation is calculated according to S-FIG. 1(a) as dS/ df · γNV = 2.97 × 10−6 V/nT.

In the slope measurement, only hyperfine lines driving is used in the CW-ODMR measurement in order to make the result comparable to the result of singlequantum Ramsey magnetometry measurement. The measured shot noise level at the demodulation frequency 9 kHz is 7.2 nV in the 73 s measurement, corresponding to 61.5 nV/Hz1/2 with bandwidth normalized to 1 Hz. Thus, the sensitivity can be calculated as 20.7 pT/Hz1/2. Considering the enhancement factor of about 1.3 times with DR driving, we can expect a sensitivity of 15.9 pT/Hz1/2 in CW-ODMR measurements. Since we illuminate the entire diamond with volume of (0.5 mm)3, volume-normalized sensitivity can be calculated as ηv = η√V , which is 5.6 pT/(Hz · mm−3)1/2.

2. Ramsey measurement

As for the CE-Ramsey measurement, the dephasing time T2∗ = 8.5 μs is measured as shown in S-Fig. 2(b).

According to the discussion in Section V F, the Ramsey contrast with sequence time of Tseq ≈ 110 μs is 0.17%. The collected photon per measurement is calculated based on the photon detection rate. Thus, Eq. (1) in the main text is simplified as Ramsey method is calculated according to S-Fig. 1(b), from which the maximum slope is 3.6×10−6 V/nT. With the shot noise demodulated at 9 kHz, the sensitivity of the Ramsey measurement is calculated as 17.1 pT/Hz1/2.

There could be an enhancement if double quantum magnetometry is applied. Then, the sensitivity should be better than the value measured by the CW-ODMR method.

H. Calibration of the magnetometer S-FIG. 4. (a) Output of the magnetometer (2 s cut) with 150 pT modulated test fields of 2 Hz, 5 Hz, and 10 Hz. The modulation frequency is 182 Hz. (b) is the Fourier transform of the acquired signals. The 32 Hz noise spike originates from the 150 Hz harmonics signal with the 182 Hz demodulation.

The magnitude is plotted in logarithm. The noise level is as same as the noise at 182 Hz in Fig.5 of about 2 pT with the frequency resolution of 13.7 mHz.

Since the resonant MW frequencies of the NV spin change with magnetic field following Δf = γΔB, the diamond magnetometer response to the external magnetic field can be acquired by sweeping the applied MW frequency, as shown in S-Fig. 1. From the measured slope kc, the transfer function of the magnetometer can be decided as ΔB = ΔU/γkc, where ΔU is the output voltage of the magnetometer. The intrinsic noise of the magnetometer can then be determined by the Fourier transform of the signal trace. In Fig. 5(a), the calculated spectrum is compared to the output spectrum of the OPM. In the comparison measurement, the diamond and CPC lens structure is replaced by the OPM sensor head. The other parts of the setup, i.e. laser, MW, photodetectors, keep working during the measurement. By this means, we make sure the OPM measures almost the same magnetic field noise as the diamond magnetometer.

The only difference could be the coil noise. The coil noise includes the Johnson noise and the current shot noise of the batteries. The internal resistance of the battery is 16 mΩ when it is fully charged. The resistance of each coil is 49.5 Ω. Thus, the Johnson noise of the coil is estimated as

UJ = q4kT (Rcoil + Rbattery)Δν,

(S20) where k is the Boltzmann constant, T is the temperature taken as 20 ◦C, and Δν = 1 Hz is the measured bandwidth. The current shot noise is calculated according to ishot = p2qIΔν,

(S21) where I is the current in the coil. Since the coil pair generates roughly Bbias = 10−3 T field with U = 12 V power supply, the magnetic field noise generated by the coil pair can be estimated as Bn,coil = √2Bbias pUJ2 + (ishotRcoil)2 ≈ 0.8 pT with 1 Hz band

2U width, and the noise level is 0.01 pT with measurement time of 73 s. Thus, the coil noise can be neglected with the fully charged batteries, and the noise spectral measured by the OPM and the diamond magnetometer are comparable to each other.

Another method to determine the intrinsic noise of the magnetometer is to apply a known oscillating field to the magnetometer and compare the detected signal magnitude at the frequency in the spectrum. One problem of the calibration method is that the known field signal spike could be underestimated because the magnetometer output includes both signal and noise. The underestimation of the signal magnitude leads to the underestimation of the detected noise floor. Thus, calibration measurements with known fields are used as the optional proof of the magnetometer characteristics.

The calibration field has a magnitude of 150 pT, and the frequencies are at 2 Hz, 5 Hz, and 10 Hz. In order to avoid the higher noise level and all the spikes at low frequencies according to the measured noise spectrum in Fig 5, the signals are mixed with a 182 Hz carrier and are generated by an arbitrary waveform generator. The modulated signal is applied to the coils. With the lockin demodulation at 182 Hz we can directly see the lowfrequency signal detected by the diamond magnetometer, as shown in S-Fig. 4(a). In order to suppress the harmonics, the cut-off frequency of the lock-in filter is set at 49 Hz, while the 32 Hz signal can still be seen due to the demodulation of 150 Hz noise spike at 182 Hz. The photodetector output is first translated to magnetic field according to the slope determined by sweeping the MW frequency. The resolved magnetic field magnitudes are 153.3 pT, 152.1 pT, and 136.8 pT at the three frequencies, respectively. The detected 10 Hz signal is significantly lower than 150 pT. The reason could be that the applied mixed signal has 172 Hz and 192 Hz components, and the 192 Hz signal is close to the 200 Hz bandwidth of the magnetometer, which is set by the cut-off frequency of the lock-in filter for demodulating the MW modulation. The magnetometer sensitivity of the three measurements can be calculated with the spectrum according to reference [ 3 ], as 14.5 pT/Hz1/2, 16.7 pT/Hz1/2, and 17.3 pT/Hz1/2. The sensitivity results to a noise level of about 2 pT with the 73 s measurement, which is also the noise level at 182 Hz in the spectrum of Fig. 5.

Modulated MW bandwidth

One of the issues using the diamond with narrow linewidth is that the modulated MW could have a bandwidth similar to the original ODMR linewidth, and the modulated MW would reduce the ODMR contrast and broaden the linewidth. To look into the effect of applying modulated MW with a broaden bandwidth, we use an amplitude modulated (AM) MW signal with only two single-tuned sideband signals at f0 + fm and f0 − fm.

S-Figure 5 plots the ODMR spectrum driven by AM MW signals modulated at different frequencies. The contrast reduces with the increasing fm mainly because of the bandwidth of the magnetometer. There is an apparent splitting of the spectrum which further deteriorates contrast when the MW bandwidth 2fm approaching the HWHM of the ODMR spectrum (from fm = 20 kHz).

As for FM and PM signal, the bandwidth is defined by the Carson bandwidth:

BWF M = 2(fd + fm), BWP M = 2(φd + 1)fm.

(S22) (S23)

In this work, the applied modulation frequency fm = 9 kHz, and φd = 2. The Carson bandwidth BWF M = 54 kHz near the FWHM linewidth. The components amplitude is decomposed by the Bessel function first kind Jn(β), which are 0.224, 0.577, 0.353, 0.129 for n=0 to 3. Thus, most of the MW energy (n=0 to 2) within the NV linewidth makes the parameters reasonable in the measurements.

J. Enhancement with the flux guide

Schematic of the ferrite-diamond geometry is depicted in S-Fig 6(a). The magnetic field generated by the coil is concentrated and guided to the diamond through the FG. The main dynamic of the measured signal is the field change generated by the voltage dynamic of the batteries.

In applications, getting closer to the magnetic field source is the alternation for having higher sensitivity. Thus, applying the flux guide to approach the magnetic field source is equivalent to an improvement of sensitivity.The raw outputs of the diamond magnetometer are plotted as S-FIG. 6(b) shows. The field keeps reducing because the batteries are discharging. Detected field reduction rates are calculated by linear fitting the measured traces.

The detected field reduction rate with FG is 6.3 times faster than the result measured without FG, which means the diamond magnetometer can detect 6.3 times weaker S-FIG. 5. AM signal driving ODMR spectrum detected by LIA with the different amplitude modulation frequencies magnetic source from the other end of the FG than measurement without it. It is equivalent to having the scalar factor improved by 6.3 times. A geometry-based simulation is made by using Finite Element Method Magnetics (FEMM 4.2) software. The length of the ferrite rod and the radius of the coils used in the model is 16 cm, approximating the experimental parameters. The flux enhancement at the diamond center is calculated to be roughly 14 times. The reason for the enhancement difference to the experimental result could be errors in the primary parameters shown in S-Fig. 6(a) which can affect the flux concentration a lot. In order to quantify the collection of extra noises by the flux guide, the discharging field is removed before we do Fourier transform to the signal traces.

S-Figure 6(c) is the original noise floor spectra, and (d) is the spectra after removing the discharging signals from the outputs. The low-frequency noise spikes can be seen after removing the batteries discharging signal, and they are generated by the cooling fan motors of instruments in the laboratory. The low-frequency spikes are hard to be seen by the magnetometer in the shields. However, after applying the FG, of which one end is close to a window of the shields, the noise spikes are guided by the FG and detected by the magnetometer. It can also be seen that more noises are collected by the ferrite rod, and the noise floor is lifted to 3 – 4 pT at the location of the diamond.

Taking the signal gain induced by the ferrite rod into consideration, the SNR of the probe is improved, and the minimum detectable field is estimated as 0.3 - 0.7 pT according to the spectrum in Fig 5(b).

The flux guide is not optimal in this experiment. For example, the gap between the ferrite and the diamond is as large as 1 mm, and it makes the flux gain much lower than using a smaller gap. A flux return ferrite could give a higher gain for flux concentration and better field homogeneity. We simulate a flux guide schematic as S-FIG. 7 draws, and it is a planar simulation. A magnet dipole is placed 75 mm away from the diamond and the field at location of the diamond without FG is 0.055 G. The difference of the field distribution within the diamond is 0.4% according to the calculation. When the flux guide is applied, the flux is concentrated and guided, and the magnetometer detects a field of 3.86 G, with the field difference also around 0.4%. Thus, the field is amplified by 70 times in the planar simulation, while the spatial resolution is kept around 1 mm according to the distal diameter of the FG. S-FIG. 6. (a) Key dimensions of the ferrite-diamond geometry used in experiment and flux simulation. The unit is in mm. (b) The magnetic field changes detected by diamond magnetometer with and without the FG. The ramping signals are induced by discharging of the batteries, and are used for calibration by linear fitting the discharging rate. (c) The magnetic field noise spectra analyzed from the raw output of the magnetometer. (d) The magnetic field noise spectra analyzed from the signal with the discharging field signals removed. The low-frequency noise spikes appear after the filtering.

[1] J. C. Allred , R. N. Lyman , T. W. Kornack , and M. V. Romalis , High-sensitivity atomic magnetometer unaffected by spin-exchange relaxation , Phys. Rev. Lett . 89 , 130801 ( 2002 ). [2] J. M. Taylor , P. Cappellaro, L. Childress , L. Jiang , D. Budker , P. R. Hemmer , A. Yacoby , R. Walsworth , and M. D. Lukin , High-sensitivity diamond magnetometer with nanoscale resolution , Nat. Phys . 4 , 810 ( 2008 ). [3] H. B. Dang , A. C. Maloof , and M. V. Romalis , Ultrahigh sensitivity magnetic field and magnetization measurements with an atomic magnetometer , Appl. Phys. Lett . 97 , 151110 ( 2010 ). [4] T. Wolf , P. Neumann , K. Nakamura , H. Sumiya , T. Ohshima , J. Isoya , and J. Wrachtrup , Subpicotesla diamond magnetometry , Phys. Rev. X 5 , 041001 ( 2015 ). [5] N. Hinkley , J. A. Sherman , N. B. Phillips , M. Schioppo , N. D. Lemke , K. Beloy , M. Pizzocaro , C. W. Oates , and A. D. Ludlow , An atomic clock with 10-18 instability , Science 341 , 1215 ( 2013 ). [6] J. Michl , J. Steiner , A. Denisenko , A . Bu¨lau, A. Zimmermann , K. Nakamura , H. Sumiya , S. Onoda , P. Neumann , J. Isoya , and J. Wrachtrup , Robust and accurate electric field sensing with solid state spin ensembles , Nano Lett . 19 , 4904 ( 2019 ). [7] P. Neumann , I. Jakobi , F. Dolde , C. Burk , R. Reuter , G. Waldherr, J. Honert , T. Wolf , A. Brunner , and J. H. Shim , High-precision nanoscale temperature sensing using single defects in diamond , Nano Lett . 13 , 2738 ( 2013 ). [8] T. W. Kornack , R. K. Ghosh , and M. V. Romalis , Nuclear spin gyroscope based on an atomic comagnetometer , Phys. Rev. Lett . 95 , 230801 ( 2005 ). [9] C. Zhang , H. Yuan , Z. Tang , W. Quan , and J. C. Fang , Inertial rotation measurement with atomic spins: From angular momentum conservation to quantum phase theory , Appl. Phys. Rev . 3 , 041305 ( 2016 ). [10] X. Wu , Z. Pagel , B. S. Malek , T. H. Nguyen , F. Zi , D. S. Scheirer , and H. Mu ¨ller, Gravity surveys using a mobile atom interferometer , Sci. Adv . 5 , eaax0800 ( 2019 ). [11] C. Gemmel , W. Heil , S. Karpuk , K. Lenz , Y. Sobolev , K. Tullney , M. Burghoff , W. Kilian , S. KnappeGru¨neberg, W. Mu¨ller, A. Schnabel , F. Seifert , L. Trahms , and U. Schmidt , Limit on lorentz and cpt violation of the bound neutron using a free precession 3he/129xe comagnetometer , Phys. Rev. D 82 , 111901(R) ( 2010 ). [12] M. Pospelov , S. Pustelny , M. P. Ledbetter , D. F. Jackson Kimball , W. Gawlik , and D. Budker , Detecting domain walls of axionlike models using terrestrial experiments , Phys. Rev. Lett . 110 , 021803 ( 2013 ). [13] E. Boto , N. Holmes , J. Leggett , G. Roberts , V. Shah , S. S. Meyer, L. D. Mun˜oz, K. J. Mullinger , T. M. Tierney , and S. Bestmann , Moving magnetoencephalography towards real-world applications with a wearable system , Nature 555 , 657 ( 2018 ). [14] R. Zhang , W. Xiao, Y. Ding , Y. Feng , X. Peng , L. Shen , C. Sun , T. Wu, Y. Wu , and Y. Yang , Recording brain activities in unshielded earth's field with optically pumped atomic magnetometers , Sci. Adv . 6 , eaba8792 ( 2020 ). [15] P. Maletinsky , S. Hong , M. S. Grinolds , B. Hausmann , M. D. Lukin , R. L. Walsworth , M. Loncar , and A. Yacoby , A robust scanning diamond sensor for nanoscale imaging with single nitrogen-vacancy centres , Nat. Nano . 7 , 320 ( 2012 ). [16] E. Schaefer-Nolte , F. Reinhard , M. Ternes , J. Wrachtrup , and K. Kern , A diamond-based scanning probe spin sensor operating at low temperature in ultra-high vacuum , Rev. Sci. Instrum . 85 , 013701 ( 2014 ). [17] Q. Sun , T. Song , E. Anderson, T. Shalomayeva , J. F¨orster, A . Brunner, T. Taniguchi , K. Watanabe , J. Gr¨afe, R. St¨ohr, X. Xu , and J. Wrachtrup , Magnetic domains and domain wall pinning in two-dimensional ferromagnets revealed by nanoscale imaging , Nat. Commun . 12 , 1989 ( 2021 ). [18] S. Gudoshnikov , N. Usov , A. Nozdrin , M. Ipatov , A. Zhukov , and V. Zhukova , Highly sensitive magnetometer based on the off-diagonal gmi effect in co-rich glasscoated microwire , Phys. Status Solidi A 211 , 980 ( 2014 ). [19] B. Patton , E. Zhivun , D. C. Hovde , and D. Budker , Alloptical vector atomic magnetometer , Phys. Rev. Lett . 113 , 013001 ( 2014 ). [20] A. Grosz , V. Mor , S. Amrusi , I. Faivinov , E. Paperno, and L. Klein , A high-resolution planar hall effect magnetometer for ultra-low frequencies , IEEE Sens. J 16 , 3224 ( 2016 ). [21] V. Shah , S. Knappe , P. D. D. Schwindt , and J. Kitching , Subpicotesla atomic magnetometry with a microfabricated vapour cell , Nat. Photonics 1 , 649 ( 2007 ). [22] C. S. Shin , C. E. Avalos , M. C. Butler , D. R. Trease , S. J. Seltzer , J. P. Mustonen , D. J. Kennedy , V. M. Acosta , D. Budker , and A. Pines , Room-temperature operation of a radiofrequency diamond magnetometer near the shotnoise limit , J. Appl. Phys . 112 , 124519 ( 2012 ). [23] D. Drung , High-performance dc squid read-out electronics , Physica C 368 , 134 ( 2002 ). [24] I. K. Kominis , T. W. Kornack , J. C. Allred , and M. V. Romalis , A subfemtotesla multichannel atomic magnetometer , Nature 422 , 596 ( 2003 ). [25] W. C. Griffith , S. Knappe , and J. Kitching , Femtotesla atomic magnetometry in a microfabricated vapor cell , Opt. Exp . 18 , 27167 ( 2010 ). [26] P. F. Wang , Z. H. Yuan , P. Huang , X. Rong , M. Q. Wang , X. K. Xu , C. Duan , C. Y. Ju , F. Z. Shi , and J. F. Du , High-resolution vector microwave magnetometry based on solid-state spins in diamond , Nat. Commun . 6 , 6631 ( 2015 ). [27] H. Zheng , J. Xu , G. Z. Iwata , T. Lenz , J. Michl , B. Yavkin , K. Nakamura , H. Sumiya , T. Ohshima , J. Isoya , J. Wrachtrup , A. Wickenbrock , and D. Budker , Zero-field magnetometry based on nitrogen-vacancy ensembles in diamond , Phys. Rev. Appl . 11 , 064068 ( 2019 ). [28] A. Wickenbrock , H. Zheng , L. Bougas , N. Leefer , S. Afach , A. Jarmola , V. M. Acosta , and D. Budker , Microwave-free magnetometry with nitrogen-vacancy centers in diamond , Appl. Phys. Lett . 109 , 053505 ( 2016 ). [29] J. F. Barry , M. J. Turner , J. M. Schloss , D. R. Glenn , Y. Song , M. D. Lukin , H. Park , and R. L. Walsworth , Optical magnetic detection of single-neuron action potentials using quantum defects in diamond , Proc. Nat. Acad. Sci. USA , 201601513 ( 2016 ). [30] I. Fescenko , A. Jarmola , I. Savukov , P. Kehayias , J. Smits , J. Damron , N. Ristoff , N. Mosavian , and V. M. Acosta , Diamond magnetometer enhanced by ferrite flux concentrators , Phys. Rev. Res . 2 , 023394 ( 2020 ). [31] J. F. Barry , J. M. Schloss , E. Bauch, M. J. Turner , C. A. Hart , L. M. Pham , and R. L. Walsworth , Sensitivity optimization for nv-diamond magnetometry , Rev. Mod. Phys . 92 , 015004 ( 2020 ). [32] See Supplemental Material at [URL] for details of the simulation model, continuous excitation, magnetometer calibration, and flux guide aided magnetometry . [33] P. Kapitanova , V. V. Soshenko , V. V. Vorobyov , D. Dobrykh , S. V. Bolshedvorskii , V. N. Sorokin , and A. V. Akimov , 3d uniform manipulation of nv centers in diamond using a dielectric resonator antenna , JETP Lett . 108 , 588 ( 2018 ). [34] A. Dr ´eau, M. Lesik , L. Rondin , P. Spinicelli , O. Arcizet , J. F. Roch , and V. Jacques , Avoiding power broadening in optically detected magnetic resonance of single nv defects for enhanced dc magnetic field sensitivity , Phys. Rev. B 84 , 195204 ( 2011 ). [35] J. P. Tetienne , L. Rondin , P. Spinicelli , M. Chipaux , T. Debuisschert , J. F. Roch , and V. Jacques , Magneticfield-dependent photodynamics of single nv defects in diamond: an application to qualitative all-optical magnetic imaging , New J. Phys . 14 , 103033 ( 2012 ). [36] Y. Dumeige , J.-F. Roch , F. Bretenaker , T. Debuisschert , V. Acosta , C. Becher , G. Chatzidrosos , A. Wickenbrock , L. Bougas , and A. Wilzewski , Infrared laser threshold magnetometry with a nv doped diamond intracavity etalon , Opt. Exp . 27 , 1706 ( 2019 ). [37] I. Meirzada , Y. Hovav , S. A. Wolf , and N. Bar-Gill , Negative charge enhancement of near-surface nitrogen vacancy centers by multicolor excitation , Phys. Rev. B 98 , 245411 ( 2018 ). [38] N. Aslam , G. Waldherr, P. Neumann , F. Jelezko , and J. Wrachtrup , Photo-induced ionization dynamics of the nitrogen vacancy defect in diamond investigated by single-shot charge state detection , New J. Phys . 15 , 013064 ( 2013 ). [39] W. C. Griffith , R. Jimenez-Martinez , V. Shah , S. Knaappe , and J. Kitching , Miniature atomic magnetometer integrated with flux concentrators , Appl. Phys. Lett . 94 , 023502 ( 2009 ). [40] K. Fang , V. M. Acosta , C. Santori , Z. Huang , K. M. Itoh , H. Watanabe , S. Shikata , and R. G. Beausoleil , Highsensitivity magnetometry based on quantum beats in diamond nitrogen-vacancy centers , Phys. Rev. Lett . 110 , 130802 ( 2013 ). [41] E. Bauch , C. A. Hart , J. M. Schloss , M. J. Turner , J. F. Barry , P. Kehayias , S. Singh , and R. L. Walsworth , Ultralong dephasing times in solid-state spin ensembles via quantum control , Phys. Rev. X 8 , 031025 ( 2018 ). [42] C. Zhang , H. Yuan , N. Zhang , L. Xu, J. Zhang , B. Li , and J. Fang , Vector magnetometer based on synchronous manipulation of nitrogen-vacancy centers in all crystal directions , J. Phys. D 51 , 155102 ( 2018 ).