Light field image processing method for depth acquisition
11438571 · 2022-09-06
Assignee
Inventors
Cpc classification
H04N13/232
ELECTRICITY
H04N13/271
ELECTRICITY
International classification
Abstract
Techniques for capturing three-dimensional image data of a scene and processing light field image data obtained by an optical wavefront sensor in 3D imaging applications are provided. The disclosed techniques provide a depth map of an observable scene from light field information about an optical wavefront emanating from the scene, and make use of color filters forming a color mosaic defining a primary color and one or more secondary colors, and color radial transfer functions calibrated to provide object distance information from the spatio-spectrally sampled pixel data.
Claims
1. A method of imaging a scene, the method comprising: diffracting an optical wavefront originating from the scene to generate a diffracted optical wavefront; detecting the diffracted optical wavefront in a near-field regime using a pixel array comprising a plurality of light-sensitive pixels each associated with a color filter, thereby obtaining pixel data, the color filters forming a color mosaic defining a primary color and one or more secondary colors arranged such that different neighboring pixels associated with the primary color detect different spatial parts of the diffracted optical wavefront over a full cycle of the diffracted optical wavefront; parsing the pixel data according to the primary and secondary colors into corresponding primary and secondary color channels; determining a main base component and a main modulating component of the diffracted optical wavefront associated with the primary color channel; determining, based on the main modulating component, a secondary base component of the diffracted optical wavefront associated with each secondary channel; reconstructing a 2D image of the scene based on the main and secondary base components; and creating a depth map of the scene based on at least one color radial transfer function calibrated to provide object distance information from the modulating component of an associated one of the color channels.
2. The method of claim 1, wherein the primary color channel comprises a green color channel, and the one or more secondary color channels comprise a red color channel and a blue color channel.
3. The method of claim 2, wherein the color mosaic is a Bayer pattern.
4. The method of claim 1, wherein determining the main base component and the main modulating component comprises summing and subtracting pixel data from neighboring pixel banks associated with the primary color, respectively.
5. The method of claim 1, wherein determining the secondary base component associated with each secondary color channel comprises: determining a secondary modulating component associated with the secondary color channel based on the main base component and the main modulating component; and determining the secondary base component based on the secondary modulating component.
6. The method of claim 1, wherein creating the depth map comprises creating a coarse depth map based on the main modulating component and the associated color radial transfer function.
7. The method according to claim 6, wherein creating the coarse depth map comprises: determining relative depth information from the main modulating component; and determining absolute depth information from a comparison of the relative depth information to the color radial transfer function associated with the primary color channel.
8. The method of claim 6, wherein creating the depth map comprises adjusting the coarse depth map based on the one or more secondary modulating components and the associated one or more color radial transfer functions.
9. The method of claim 1, further comprising providing the diffraction grating with a grating period that is greater than a pixel pitch of the pixel array.
10. The method of claim 9, wherein the grating period is twice the pixel pitch.
11. A non-transitory computer readable storage medium having stored thereon computer executable instructions for generating three-dimensional image data of a scene from a diffracted optical wavefront originating from the scene and having been detected in a near-field regime using a pixel array comprising a plurality of light-sensitive pixels each associated with a color filter, thereby obtaining pixel data, the color filters forming a color mosaic defining a primary color and one or more secondary colors arranged such that different neighboring pixels associated with the primary color detect different spatial parts of the diffracted optical wavefront over a full cycle of the diffracted optical wavefront, the computer executable instructions, when executed by a processor having received the pixel data, cause the processor to perform the following steps: parsing the pixel data according to the primary and secondary colors into corresponding primary and secondary color channels; determining a main base component and a main modulating component of the diffracted optical wavefront associated with the primary color channel; determining, based on the main modulating component, a secondary base component of the diffracted wavefront associated with each secondary channel; reconstructing a 2D image of the scene based on the main and secondary base components; and creating a depth map of the scene based on at least one color radial transfer function calibrated to provide object distance information from the modulating component of an associated one of the color channels.
12. The non-transitory computer readable storage medium of claim 11, wherein the primary color channel comprises a green color channel, and the one or more secondary color channels comprise a red color channel and a blue color channel.
13. The non-transitory computer readable storage medium of claim 12, wherein the color mosaic is a Bayer pattern.
14. The non-transitory computer readable storage medium of claim 11, wherein determining the main base component and the main modulating component comprises summing and subtracting pixel data from neighboring pixel banks associated with the primary color, respectively.
15. The non-transitory computer readable storage medium of claim 11, wherein determining the secondary base component associated with each secondary color channel comprises: determining a secondary modulating component associated with the secondary color channel based on the main base component and the main modulating component; and determining the secondary base component based on the secondary modulating component.
16. The non-transitory computer readable storage medium of claim 11, wherein creating the depth map comprises creating a coarse depth map based on the main modulating component and the associated color radial transfer function.
17. The non-transitory computer readable storage medium according to claim 16, wherein creating the coarse depth map comprises: determining relative depth information from the main modulating component; and determining absolute depth information from a comparison of the relative depth information to the color radial transfer function associated with the primary color channel.
18. The non-transitory computer readable storage medium of claim 16, wherein creating the depth map comprises adjusting the coarse depth map based on the one or more secondary modulating components and the associated one or more color radial transfer functions.
19. The non-transitory computer readable storage medium of claim 11, wherein the diffraction grating has a grating period that is greater than a pixel pitch of the pixel array.
20. The non-transitory computer readable storage medium of claim 19, wherein the grating period is twice the pixel pitch.
21. A system for imaging a scene, the system comprising: a diffracting grating configured to diffract an optical wavefront originating from the scene to generate a diffracted optical wavefront; a pixel array comprising a plurality of light-sensitive pixels configured to detect the diffracted optical wavefront in a near-field regime, thereby obtaining pixel data; a color filter array interposed between the diffraction grating and the pixel array, the color filter array comprising a plurality of color filters each associated with a corresponding one of the plurality of light-sensitive pixels and forming a color mosaic defining a primary color and one or more secondary colors arranged such that different neighboring pixels associated with the primary color detect different spatial parts of the diffracted optical wavefront over a full cycle of the diffracted optical wavefront; and a processor configured to: receive the pixel data from the pixel array; parse the pixel data according to the primary and secondary colors into corresponding primary and secondary color channels; determine a main base component and a main modulating component of the diffracted optical wavefront associated with the primary color channel; determine, based on the main modulating component, a secondary base component of the diffracted optical wavefront associated with each secondary channel; reconstruct a 2D image of the scene based on the main and secondary base components; and create a depth map of the scene using at least one color radial transfer function calibrated to provide object distance information from the modulating component of an associated one of the color channels.
22. The system of claim 21, wherein the diffraction grating comprises a transmissive binary phase grating.
23. The system of claim 21, wherein the diffraction grating has a grating period, and wherein the pixel array has a pixel pitch that is smaller than grating period.
24. The system of claim 21, wherein the color filters comprises red filters, green filters, and blue filters, wherein the color mosaic is a Bayer pattern, and wherein the primary color channel comprises a green channel, and the one or more secondary color channels comprise a red channel and a blue channel.
25. The system of claim 21, wherein the processor is configured to determine the main base component and the main modulating component by summing and subtracting pixel data from neighboring pixel banks associated with the primary color, respectively.
Description
BRIEF DESCRIPTION OF THE DRAWINGS
(1)
(2)
(3)
(4)
(5)
(6)
(7)
(8)
(9)
(10)
(11)
(12)
(13)
(14)
DETAILED DESCRIPTION
(15) In the present description, similar features in the drawings have been given similar reference numerals, and, to not unduly encumber the figures, some elements may not be indicated on some figures if they were already identified in a preceding figure. It should also be understood that the elements of the drawings are not necessarily depicted to scale, since emphasis is placed upon clearly illustrating the elements and structures of the present embodiments.
(16) In the present description, and unless stated otherwise, the terms “connected” and “coupled”, as well as variants and derivatives thereof, refer to any connection or coupling, either direct or indirect, between two or more elements. The connection or coupling between the elements may be mechanical, optical, electrical, operational or a combination thereof. It will also be appreciated that positional descriptors and other like terms indicating the position or orientation of one element with respect to another element are used herein for ease and clarity of description and should, unless otherwise indicated, be taken in the context of the figures and should not be considered limiting. It will be understood that such spatially relative terms are intended to encompass different orientations in use or operation of the present embodiments, in addition to the orientations exemplified in the figures. More particularly, it is to be noted that in the present description, the terms “over” and “under” in specifying the relative spatial relationship of two elements denote that the two elements can be either in direct contact with each other or separated from each other by one or more intervening elements. In the present description, the terms “a”, “an” and “one” are defined to mean “at least one”, that is, these terms do not exclude a plural number of items, unless specifically stated otherwise.
(17) The present description generally relates to techniques for capturing three-dimensional image data of a scene and processing light field image data obtained by an optical wavefront sensor in 3D imaging applications. More particularly, the present description discloses techniques for generating or building a 3D or depth image or a combined 2D image and depth map of an observable scene from light field information about an optical wavefront emanating from the scene.
(18) In the present description, the term “providing” is used broadly and refers to, but is not limited to, making available for use, acquiring, obtaining, accessing, supplying, receiving, assigning and retrieving. By way of example, in some implementations, the provision of the light field image data to be processed can involve the act of directly acquiring the light field image data using a light field capture device and making available the light field image data thus acquired. However, in other implementations, the provision of the light field image data can involve the act of retrieving or receiving previously acquired light field image data, for example from a database or a storage medium.
(19) In some implementations, the present techniques involve the specific manipulation and comparison of the chromatic dependence of diffraction by means of one or more diffractive optical elements paired with an appropriate chromatic encoding mechanism, as well as its use in 3D imaging. In some implementations, the light field image data to be processed into a 3D image forms an array of image data pixels representing a spatio-spectral distribution of a light field after diffraction by a diffraction grating and spectral filtering according to a mosaic color pattern.
(20) In some implementations, the techniques disclosed herein are sensitive to not only the intensity of an optical wavefront originating from an observable scene, but also the wavelength, through a specific spatio-spectral subsampling of a generated interference or diffraction grating pattern, allowing for direct measurement of the chromatic dependence of diffraction, the angle of incidence, the phase, and the polarization of the optical wavefront. Therefore, light field imaging devices, for example depth cameras, can acquire more information than traditional cameras, which typically record only light intensity. The raw image data captured by light field capture devices can be used or processed in a variety of ways to provide multiple functions including, but not limited to, 3D depth map extraction, 3D surface reconstruction, image refocusing, and the like. Depending on the application, the light field image data of an observable scene can be acquired as one or more still images or as a video stream. That is, each individual captured frame generally contains all relevant information to generate an individual light field scene. These frames, and their subsequent light field scenes, can be combined and played sequentially to act like a video stream.
(21) The present techniques can be used in imaging applications that require or can benefit from enhanced depth sensing and other 3D imaging capabilities, for example to allow a user to change the focus, the point of view and/or the depth of field of a captured image of a scene. Non-limiting typical observable scene examples could include: a person taking a selfie using their front facing camera on a mobile phone, a car approaching an intersection with pedestrians crossing and a stop sign, a car trying to park in a tight parking spot, an individual's hands gesturing to interact with a virtual or augmented reality scene, and the like.
(22) The present techniques can be applied to or implemented in various types of 3D imaging systems and methods including, without limitation, light field imaging applications using plenoptic descriptions, ranging applications through the comparative analysis of the chromatic dependence of diffraction, and single-sensor single-image depth acquisition applications. Non-exhaustive advantages and benefits of certain implementations of the present techniques can include: compatibility with passive sensing modalities that employ less power to perform their functions; compatibility with single-sensor architectures having reduced footprint; enablement of depth mapping functions while preserving 2D performance; simple and low-cost integration into existing image sensor hardware and manufacturing processes; compatibility with conventional CMOS and CCD image sensors; and elimination of the need for multiple components, such as dual cameras or cameras equipped with active lighting systems for depth detection.
(23) In the present description, the terms “light” and “optical” are used to refer to radiation in any appropriate region of the electromagnetic spectrum. More particularly, the terms “light” and “optical” are not limited to visible light, but can also include invisible regions of the electromagnetic spectrum including, without limitation, the terahertz (THz), infrared (IR) and ultraviolet (UV) spectral bands. In some implementations, the terms “light” and “optical” can encompass electromagnetic radiation having a wavelength ranging from about 175 nanometers (nm) in the deep ultraviolet to about 300 micrometers (μm) in the terahertz range, for example from about 400 nm at the blue end of the visible spectrum to about 1550 nm at telecommunication wavelengths, or between about 400 nm and about 650 nm to match the spectral range of typical red-green-blue (RGB) color filters. Those skilled in the art will understand, however, that these wavelength ranges are provided for illustrative purposes only and that the present techniques may operate beyond these ranges.
(24) In the present description, the terms “color” and “chromatic”, and variants and derivatives thereof, are used not only in their usual context of human perception of visible electromagnetic radiation (e.g., red, green and blue), but also, and more broadly, to describe spectral characteristics (e.g., diffraction, transmission, reflection, dispersion, absorption) over any appropriate region of the electromagnetic spectrum. In this context, and unless otherwise specified, the terms “color” and “chromatic” and their derivatives can be used interchangeably with the term “spectral” and its derivatives.
(25) The present techniques can be used to process raw light field image data captured with various types of light field capture devices. Non-limiting examples of such devices are described below and illustrated in
(26) According to another aspect of the invention, there is provided a computer readable memory storing computer executable instructions thereon that, when executed by a computer, can perform various steps of the image processing method disclosed herein.
(27) As used herein, the term “computer readable memory” is intended to refer to a non-transitory and tangible computer product that can store and communicate executable instructions for the implementation of various steps of the image processing method disclosed herein. The computer readable memory can be any computer data storage device or assembly of such devices including, for example: a temporary storage unit such as a random-access memory (RAM) or dynamic RAM; a permanent storage such as a hard disk; an optical storage device, such as a CD or DVD (rewritable or write once/read only); a flash memory; and/or other non-transitory memory technologies. A plurality of such storage devices may be provided, as can be understood by those skilled in the art. The computer readable memory may be associated with, coupled to or included in a computer configured to execute instructions stored in the computer readable memory in connection with various functions associated with the computer.
(28) As used herein, the term “computer” refers broadly to any computing or processing unit or device including electronic circuitry that can control and execute, at least partly, instructions required to perform various steps of the image processing method disclosed herein. The computer can be embodied by a general-purpose computer, a central processing unit (CPU), a microprocessor, a microcontroller, a processing core, or any other processing resource or any combination of such computer or processing resources configured to operate collectively as a processing unit. For example, in some implementations, the processing unit implementing the image processing method described herein can be an image signal processor (ISP) or a digital signal processor (DSP) running on a system on a chip (SoC), a graphics processing unit (GPU), a field-programmable gate array (FPGA), or a combination thereof.
(29) It will be readily understood that in some implementations, all of the steps of the processing method disclosed herein may be accomplished by a single processor, whereas in other implementations one or more of the steps of such a method may be performed on different processors or at different physical locations. Furthermore, while in some implementations the processing of the image data may be performed by a same device also performing the acquisition of the light field, for example a phone or tablet, in other implementation the light field image data may be transmitted to a different location or device and processed separately.
(30) In some implementations, there is provided a method of processing or otherwise operating upon light field image data representing a discretized sampling of a continuous optical wavefront incident from an observable scene. The light field image data can be provided as an array of image data pixels representing a spatial distribution of the optical wavefront after diffraction by a diffraction grating assembly and, optionally, subsequently spectral filtering according to a mosaic color pattern mapped to the array of image data pixels.
(31) Referring to
(32) The light field capture device 20 includes a diffraction grating assembly or structure 24 configured to receive an optical wavefront 26 originating from the scene 22. The diffraction grating assembly 24 can include at least one diffraction grating 28, each of which having a grating axis 30 and a diffraction grating pattern or refractive index modulation pattern 32 having a grating period 34 along the grating axis 30. In
(33) Referring still to
(34) Depending on the application or use, the light field capture device can be implemented using various image sensor architectures and pixel array configurations. For example, the light field capture device can be implemented simply by adding or coupling a diffraction grating assembly on top of an already existing image sensor including a pixel array and a color filter array. For example, the existing image sensor can be a conventional 2D CMOS or CCD imager. However, in other implementations, the light field capture device can be implemented and integrally packaged as a separate, dedicated and/or custom-designed device incorporating all or most of its components (e.g., diffraction grating assembly, pixel array, color filter array).
(35) For example, referring to
(36) In frontside illumination technology, the pixel array circuitry 86 includes an array of metal wiring (e.g., a silicon layer hosting a plurality of metal interconnect layers) connecting the color filters 44 to their corresponding light-sensitive pixels 40. Meanwhile, backside illumination technology provides opportunities for directly sampling the diffracted wavefront 36 produced by diffraction of an optical waveform 26 by the diffraction grating assembly 24. As light does not have to pass through the array of metal wiring of the pixel array circuitry 86 before reaching the pixel array 38, which otherwise would result in a loss of light, more aggressive diffraction grating designs with increased periodicity can be implemented. Also, the shorter optical stack configuration, as shown in
(37) More detail regarding the structure, configuration and operation of the components introduced in the preceding paragraphs as well as other possible components of the light field capture device will be described below.
(38) Returning to
(39) Diffraction occurs when a wavefront, whether electromagnetic or otherwise, encounters a physical object or a refractive-index perturbation. The wavefront tends to bend around the edges of the object. Should a wavefront encounter multiple objects, whether periodic or otherwise, the corresponding wavelets may interfere some distance away from the initial encounter as demonstrated by Young's double slit experiment. This interference creates a distinct pattern, referred to as a “diffraction grating pattern” or “diffraction pattern” or “interference pattern”, as a function of distance from the original encounter, which is sensitive to the incidence angle and the spectral content of the wavefront, and the general size, shape, and relative spatial relationships of the encountered objects. This interference can be described through the evolving relative front of each corresponding wavelet, as described by the Huygens-Fresnel principle.
(40) In the present description, the term “diffraction grating”, or simply “grating”, generally refers to a periodic structure having periodically modulated optical properties (e.g., a refractive index modulation pattern) that spatially modulates the amplitude and/or the phase of an optical wavefront incident upon it. The diffraction grating may also be referred to as a Transmission Diffraction Mask (TDM). A diffraction grating can include a periodic arrangement of diffracting elements (e.g., alternating ridges and grooves) whose spatial period—the grating period—is nearly equal to or slightly longer than (e.g., up to a few times longer than) the wavelength of light. An optical wavefront containing a range of wavelengths incident on a diffraction grating will, upon diffraction, have its amplitude and/or phase modified, and, as a result, a space- and time-dependent diffracted wavefront is produced. In general, a diffracting grating is spectrally dispersive so that each wavelength of an input optical wavefront will be outputted along a different direction. However, diffraction gratings exhibiting a substantially achromatic response over an operating spectral range exist and can be used in some implementations. For example, in some implementations, the diffraction grating can be achromatic in the spectral range of interest and be designed for the center wavelength of the spectral range of interest. More particularly, in the case of a Bayer patterned color filter array, the diffraction grating can be optimized for the green channel, that is, around a center wavelength of about 532 nm. It is to be noted that when the diffraction grating is achromatic, it is the mosaic color pattern of the color filter array that provides the chromatic sub-sampling of the diffraction pattern of the diffracted wavefront.
(41) Depending on whether the diffracting elements forming the diffraction grating are transmitting or reflective, the diffraction grating will be referred to as a “transmission grating” or a “reflection grating”. The diffracting gratings described herein are transmission gratings, although the use of reflection gratings is not excluded a priori. Diffraction gratings can also be classified as “amplitude gratings” or “phase gratings”, depending on the nature of diffracting elements. In amplitude gratings, the perturbations to the initial wavefront caused by the grating are the result of a direct amplitude modulation, while in phase gratings, these perturbations are the result of a specific modulation of the relative group-velocity of light caused by a periodic variation of the refractive index of the grating material. The diffracting gratings described herein are phase gratings, although amplitude gratings can also be used in other implementations.
(42) In
(43) It is to be noted that while the diffraction grating 28 in
(44) Referring still to
(45) Referring still to
(46) The pixel array 38 can also be characterized by a pixel pitch 62. In the present description, the term “pixel pitch” generally refers to the spacing between the individual pixels 40 and is typically defined as the center-to-center distance between adjacent pixels 40. Depending on the physical arrangement of the pixel array 38, the pixel pitch 62 along the two orthogonal pixel axes 58, 60 may or may not be the same. It is to be noted that a pixel pitch can also be defined along an arbitrary axis, for example along a diagonal axis oriented at 45° with respect to the two orthogonal pixel axes 58, 60. It is also to be noted that, in the present techniques, a relevant pixel pitch 62 is the one along the grating axis 30 of the overlying diffraction grating 28, as depicted in
(47) In the present description, the term “pixel data” refers to the image information captured by each individual pixel and can include intensity data indicative of the total amount of optical energy absorbed by each individual pixel over an integration period. Combining the pixel data from all the pixels 40 yields “raw” light field image data about the scene 22. In the present techniques, because the optical wavefront 26 incident from the scene 22 is diffracted and spatially and spectrally filtered prior to detection, the light field image data provides information about not only the intensity of the incident wavefront 26, but also other light field parameters such as its angle of incidence, phase and spectral content. More particularly, it will be understood that the present techniques can allow recovery or extraction of depth or other light field information from the intensity-based diffraction pattern captured by the pixel array 38, as described further below.
(48) Referring still to
(49) In
(50) Referring now to
(51) Referring still to
(52) The diffraction grating 28 and the pixel array 38 are disposed relative to each other such that the light-receiving surface 68 of the pixel array 38 is positioned in the near-field diffraction region, or simply the near field, of the diffraction grating 28, as described in co-assigned international PCT patent application No. PCT/CA2017/050686. In the near-field diffraction regime, the Fresnel diffraction theory can be used to calculate the diffraction pattern of waves passing through a diffraction grating. Unlike the far-field Fraunhofer diffraction theory, Fresnel diffraction accounts for the wavefront curvature, which allows calculation of the relative phase of interfering waves. To detect the diffracted wavefront 36 in the near field, the present techniques can involve maintaining a sufficiently small separation distance 72 between the top surface 48 of the diffraction grating 28, where refractive index modulation pattern 32 is formed and diffraction occurs, and the light-receiving surface 68 of the underlying pixel array 38, where the diffracted wavefront 36 is detected. In some implementations, this can involve selecting the separation distance 72 to be less than about ten times a center wavelength of the optical wavefront 26.
(53) In the near-field diffraction regime, the intensity profile 70 of the diffracted wavefront 36 produced by a periodic diffraction grating 28 generally has a spatial period 74 that substantially matches the grating period 34 of the diffraction grating 28 as well as a shape that substantially matches the refractive index modulation pattern 32 of the diffraction grating 28. For example, in
(54) In the embodiment illustrated in
(55) Upon being optically coupled to an underlying pixel array 38, the diffraction grating 28 convolves light phase information with a standard 2D image, so that the intensity profile 70 of the diffraction pattern of the detected diffracted wavefront 36 can generally be written as a modulated function I˜I.sub.mod(depth info)×I.sub.base (2D image) including a modulating component I.sub.mod and a base component I.sub.base. The base component I.sub.base represents the non-phase-dependent optical wavefront that would be detected by the pixel array 38 if there were no diffraction grating 28 in front of it. In other words, detecting the base component I.sub.base alone would allow a conventional 2D image of the scene 22 to be obtained. Meanwhile, the modulating component I.sub.mod, which is generally small compared to the base component I.sub.base (e.g., ratio of I.sub.mod to I.sub.base ranging from about 0.1 to about 0.3), is a direct result of the phase of the incident optical wavefront 26, so that any edge or slight difference in incidence angle will manifest itself as a periodic electrical response spatially sampled across the pixel array 38. The sensitivity to incidence angle, and therefore the angular resolution, depends on the specific design of the diffraction grating 28.
(56) It will be understood that the intensity profile 70 of the diffracted wavefront 36 that is detected by the pixel array 38 after spatio-spectral filtering by the color filter array 42 is a combination or superposition of the portions of the diffracted wavefront 36 filtered by the red filters, the portions of the diffracted wavefront 36 filtered by the green filters, and the portions of the diffracted wavefront 36 filtered by the blue filters. As such, using a standard RGB Bayer pattern as an example, the modulating component I.sub.mod and the base component I.sub.base of the intensity profile I can be split into their respective color components as follows:
I.sub.R˜I.sub.mod,R(depth info)×I.sub.base,R(2D image), (3)
I.sub.G˜I.sub.mod,G(depth info)×I.sub.base,G(2D image), (4)
I.sub.B˜I.sub.mod,B(depth info)×I.sub.base,B(2D image). (5)
(57) In
(58) Referring still to
(59) Depending on the application, the ratio R of the grating period 34 of the diffraction grating 28 to the pixel pitch 62 of the pixel array 38 along the grating axis 30 can take several values. In some implementations, the ratio R can be equal to or greater than two (i.e., R≥2); or equal to a positive integer greater than one (i.e., R=(n+1), where n={1, 2, . . . }); or equal to an integer power of two (i.e., R=2n, where n={1, 2, . . . }); or the like. In some implementations, it may be beneficial or required that the grating period 34 be not only larger than, but also not too close to the pixel pitch 62 along the grating axis 30. For example, in some implementations, it may be advantageous that the grating period 34 be at least about twice the underlying pixel bank pitch 62 to allow for each pair of adjacent pixel banks to sufficiently subsample the resultant modulated diffracted wavefront 36, whose spatial modulation rate is dictated by the properties of the diffraction grating 28, near or at Nyquist rate. This Nyquist, or nearly Nyquist, subsampling can allow for the direct removal of the modulating component I.sub.mod from the measured signal I by standard signal processing techniques. Once removed, the modulating signal I.sub.mod may be manipulated independently of the base component I.sub.base.
(60) For example, in
I.sub.base=½[I(bank.sub.n)+I(bank.sub.n+1)], (6)
I.sub.mod=½[I(bank.sub.n)−I(bank.sub.n+1)]. (7)
(61) It is to be noted that I(bank.sub.n)±I(bank.sub.n+1) in Equations (6) and (7) are generally obtained by summing the intensities measured by the pixels of all the rows in the associated pixel bank.
(62) Referring still to
(63) In a standard RGB Bayer pattern, because the red and blue filters are always located in adjacent pixel banks, the signals I.sub.R and I.sub.B, which are associated with the sparsely sampled red and blue components, will be in antiphase relative to each other. Meanwhile, because green filters are present in all pixel banks, the signal I.sub.G, which is associated with the densely sampled green components, will contain both in-phase and out-of-phase contributions.
(64) In
(65) In the examples described so far, the diffraction grating assembly included a single diffracting grating. However, referring to
(66) In
(67) In some applications, the light field capture device can include wavefront conditioning optics in front of the diffraction grating. The wavefront conditioning optics can be configured to collect, direct, transmit, reflect, refract, disperse, diffract, collimate, focus or otherwise act on the optical wavefront incident from the scene prior to it reaching the diffraction grating assembly. The wavefront conditioning optics can include lenses, mirrors, filters, optical fibers, and any other suitable reflective, refractive and/or diffractive optical components, and the like. In some implementations, the wavefront conditioning optics can include focusing optics positioned and configured to modify the incident wavefront in such a manner that it may be sampled by the light field capture device.
(68) Referring now to
(69) For exemplary purposes, it is assumed in
(70) Referring to
(71) The image data captured by the light field capture device 20 of
(72) Referring still to
(73) The ability of this optical element to focus incident light 26 is related to the light's origin distance from the focal plane of the optical element and the focusing power of the optical element. When incident light emanates from focal or hyperfocal distances, or optical infinity, its wavefront will, in general, effectively be tightly focused onto very few pixels 40. Light that is insufficiently focused, or focused too strongly, will continue to spread across the pixel array 38, leading to a curved wavefront which can expand over multiple pixels 40.
(74) The focused light then encounters the diffraction grating assembly 24 disposed atop the imaging system (i.e., color filter array 42 and pixel array 38). In
(75) The diffracted wavefront 36 is further spectrally filtered by the underlying color filter array 40 into two or more discrete spectral components. The intensity of this spectrally filtered, diffracted wavefront is finally measured by the underlying pixels 40 and converted into electrical signals representative of the intensity of light captured by the pixels 40. The electrical signals can then be digitized into “raw” light field image data, typically arranged or formatted into an image frame of rows and columns of data pixels. This light field image data will generally be a collection of intensity, spectral, and angular information about the original wavefront, sampled at the pixel level, which can be summarized as follows: I=Angular Information (wavelength, diffraction grating assembly location)+2D (wavelength, pixel location). The present description discloses image processing techniques in which this light field image data is processed to reconstruct a light field image of the observed scene 22. Depending on the application, the present techniques can provide a full resolution 2D image, a 3D point cloud, a depth map/image and/or a linearly reconstructed light field image from the captured light field image data.
(76) Returning to
(77) As mentioned above, the raw light field image data in each color channel can be expressed as a modulated function I˜I.sub.mod(depth info)×I.sub.base(2D image) including a modulating component I.sub.mod and a base component I.sub.base. The base component I.sub.base represents the non-phase-dependent optical wavefront from which a conventional 2D image of a scene can be obtained. The modulating component I.sub.mod results from the phase-dependent perturbation to the incident wavefront created by the diffraction grating assembly 24. Therefore, in the case of a standard RGB Bayer pattern, the intensity profile of each color channel can be written as follows I.sub.N˜I.sub.mod,N(depth info)×I.sub.base,N(2D image), where N=color or spectral channel, for example red (R), green (G) and blue (B)—see Equations (3) to (5). As mentioned above, the present techniques are not limited to Bayer-type patterns, but can be applied to any appropriate mosaic color pattern.
(78) In applications where the diffraction grating assembly 24 includes multiple sets of diffraction gratings 28a, 28b (e.g., having different orientations, as in
(79) In some implementations of step 202, the color channels into which the light field image data is parsed includes a main color channel and one or more secondary or auxiliary color channels. In the present description, a main color channel is a color channel having pixels in all pixel banks, such that pixels of the main color channel in adjacent pixel banks sample different spatial parts of the intensity profile of the diffracted wavefront 36 over a full cycle. As mentioned above, a pixel bank refers herein to a group of pixels 40 of the pixel array 38 that are arranged along a line which is perpendicular to the grating axis 30a, 30b of the overlying diffraction grating 28a, 28b. In some implementations, there could be two or more main channels if there are more than one color having pixels in all pixel banks.
(80) In the device 20 of
(81) Referring still to
(82) In the example of
I.sub.base,G=½[I.sub.G(bank.sub.n)+I.sub.G(bank.sub.n+1)]. (8)
(83) Alternatively, the modulating component I.sub.mod,G may be removed by Fourier filtering.
(84) The method 200 can also include a step 206 of using the main color channel to provide or extract a coarse raw modulating component, which will be used to obtain coarse phase/depth information about the scene 22.
(85) Again, in the example of
I.sub.mod,G=½[I.sub.G(bank.sub.n)−I.sub.G(bank.sub.n+1)]. (9)
(86) The modulating component I.sub.mod,G can provide coarse angular or phase information about the incident optical wavefront, from which light field images and/or depth maps can be generated. As described above, the modulating component I.sub.mod,G represents the wavefront's phase difference pixel by pixel, so that I.sub.mod,G contains information about the phase or spread of the incident optical wavefront 26 from the scene 22. More particularly, the modulating component I.sub.mod,G is the result of the tilt/angular spread of the green component of the incident wavefront 26 (e.g., caused by focusing optics in front of the diffraction grating 24 assembly—see
(87) Referring still to
(88) In some implementations, the difference between I.sub.G(bank.sub.n) and I.sub.G(bank.sub.n+1) can give the sign of the modulation pattern imparted by the diffraction grating. That is, if I.sub.G(bank.sub.n)<I.sub.G(bank.sub.n+1), then it can be concluded that bank.sub.n undergoes destructive interference and bank.sub.n+1 undergoes constructive interference, and vice versa if I.sub.G(bank.sub.n)>I.sub.G(bank.sub.n+1). Therefore, in a scenario where I.sub.G(bank.sub.n)<I.sub.G(bank.sub.n+1), with red pixels in bank.sub.n and blue pixels in bank.sub.n+1, it can be deduced that I.sub.R(bank.sub.n)<I.sub.base,R and that I.sub.B(bank.sub.n+1)>I.sub.base,B, such that I.sub.base,R=I.sub.R(bank.sub.n)+I.sub.mod,R and I.sub.base,B=I.sub.B(bank.sub.n+1)−I.sub.mod,B. The knowledge of the sign of [I.sub.G(bank.sub.n)−I.sub.G(bank.sub.n+1)] can therefore allow to determine whether to apply a constructive or destructive phase offset to I.sub.R(bank.sub.n) and I.sub.B(bank.sub.n+1) to obtain (I.sub.base,R, I.sub.mod,R) and (I.sub.base,B, and I.sub.mod,B).
(89) To obtain the amplitude of the phase offset for red light, that is, I.sub.mod,R one can use a relationship between the wavefront spread of green light and the wavefront spread of red light in the image capture device (e.g., caused by the diffraction grating and focusing optics) to derive I.sub.mod,R from I.sub.mod,G. Similarly, to obtain the amplitude of the phase offset for blue light, that is, I.sub.mod,B, one can use a relationship between the wavefront spread of green light and the wavefront spread of blue light in the image capture device (e.g., caused by the diffraction grating and focusing optics) to derive I.sub.mod,B from I.sub.mod,G. In some implementations, fine angular information about the wavefront spread can be obtained from the phase-shifted red and blue secondary modulating components I.sub.mod,R and I.sub.mod,B, as described below. Finally, the red and blue base components can be obtained using I.sub.base,R=I.sub.R(bank.sub.n)+I.sub.mod,R and I.sub.base,B=I.sub.B(bank.sub.n+1)−I.sub.mod.
(90) Referring still to
(91) Referring still to
(92) In some implementations, the color radial transfer functions (CRTFs) relate the diffraction grating pattern of a diffraction grating as a function of angle to the distance of an object is away from the camera. In other words, the CRTFs may allow one to quantify the modulation imparted for two phase components of a diffraction grating, phase0 and phase180, without apriori knowledge of the scene.
(93) For the purpose of explanation only and without being limitative to the scope of the present description, let there be considered a situation in which a camera is capturing images of a white light point-source in predetermined intervals at increasing distances away from the camera, as shown in
Pixel.sub.Red∝Intensity(λ)×Transmission(λ) (10.1)
or Pixel.sub.Red=2D(Red) (10.2)
(94) The TDM's modulated phase component will also vary from pixel bank to pixel bank:
Pixel.sub.n∝Intensity(λ)×Transmission(λ)×Modulation.sub.Phase0(λ,Z) (11.1)
Pixel.sub.n+1∝Intensity(λ)×Transmission(λ)×Modulation.sub.Pphase180(λ,Z) (11.2)
(95) In the case of a Bayer patterned sensor where the underlying color filters spatio-chromatically sample the light at a rate greater than the perturbation frequency caused by the diffraction grating (i.e. the pixel array having a pixel pitch along the grating axis that is smaller than the grating period), the three-color channels, as shown in
Pixel.sub.Red=2D(Red)×Modulation.sub.Phase0(λ,Z) (12.1)
Pixel.sub.Green1=2D(Green)×Modulation.sub.Phase180(λ,Z) (12.2)
Pixel.sub.Green2=2D(Green)×Modulation.sub.Phase0(λ,Z) (12.3)
Pixel.sub.Blue=2D(Blue)×Modulation.sub.Phase180(λ,Z) (12.4)
(96) Or, using the formalism developed above:
Pixel.sub.RedI.sub.BaseR(Z)×I.sub.ModR.sub.
Pixel.sub.Green1=I.sub.BaseG1(Z)×I.sub.ModR.sub.
Pixel.sub.Green2=I.sub.BaseG2(Z)×I.sub.ModR.sub.
Pixel.sub.Blue=I.sub.BaseB(Z)×I.sub.ModR.sub.
(97) The functional forms of equations 12.1 through 12.4 and 13.1 through 13.4 are shown in
(98) In the case of a Bayer patterned sensor, one can remove the influence of the diffraction grating on the image, or in other words, extract depth information, by having one color channel sample both phase components:
Pixel.sub.n=Intensity(n)×phase0(Z) (14.1)
Pixel.sub.n+1=Intensity(n+1)×phase180(Z) (14.2)
(99) This brings the definition of a simplified main (primary) color channel radial transfer function whose functional form is shown in
Δa(Z)=Pixel.sub.n−Pixel.sub.n+1=Δphase(Z) (15)
(100) The chromatic radial transfer function uses the difference between the two main color channel modulated components to remove intensity from the image, allowing for a normalized value to relate the other two color channels who do not have both phase components and whose intensities are therefore variable with both the 2D image as well as the diffraction grating modulation, or depth information.
(101) A similar experiment to that imagined in
(102) Referring to
(103) This concept can also be extended to the lens system and its achromatic aberrations and chromatically dependent focal differences through the same processing method.
(104) In some embodiments, the raw wavefront spread information contained in the modulating component I.sub.mod,G can be compared to an appropriate green radial transfer function (GRTF) to yield a coarse depth map of the scene 22. More particularly, I.sub.mod,G can provide relative phase information and comparison with a calibrated, sensor-specific GRTF can allow one to obtain absolute phase information from the relative phase information provided by I.sub.mod,G. In other words, this means that the relative phase information provided by I.sub.mod,G can be mapped to a wavefront spread using the GRTF, this wavefront spread itself corresponding to an object position relative to the focal plane.
(105) Optionally, the coarse depth map obtained from I.sub.mod,G and the GRTF can be corrected or enhanced using data from the secondary color channels (e.g., I.sub.mod,R, I.sub.mod,B and their associated red and blue radial transfer functions RRTFs and BRTFs). This means that the direct comparison of the modulating components I.sub.mod,G, I.sub.mod,R and I.sub.mod,B for the three color channels is done through measured CRTFs for each color channel in order to establish object distance.
(106) As mentioned above, each one of the CRTFs provides an empirical fit of the directly measured wavefront spread as a function of the focal plane of the image capture device 20. Collections of these sparse data-sets of calibration images captured at varying distances from the focal plane are fit as a function of their distance from the camera and provide a direct empirical measure of the radially extended spread of the wavefront as a function of distance from the focal plane as measured by the modulating components I.sub.mod,G (main), and I.sub.mod,R, and I.sub.mod,B (secondary) of the image and sampled by the diffraction grating assembly 24. These functions provide a method for directly mapping the measured relative phase/wavefront spread information given from I.sub.mod,G, I.sub.mod,R, and I.sub.mod,B to an absolute object distance from the image sensor, and therefore, depth (Z):
Object(Z)˜RGB2D Image×CRTF(r,ϕ,θ,n,λ), (10)
where z is the distance from the detection plane of image capture device of a given object in the scene 22, RGB 2D Image is the raw 2D image given by I.sub.base,G, I.sub.base,R, and I.sub.base,B, and the CRTF is a fitted 5D function of the polar coordinates r, ϕ, θ from the focal plane of the image, the pixel number n (and therefore spatial location on the pixel array 38), and the incident wavelength λ. The CRTFs can be sampled in a single image for all spectral elements as sampled by the diffractive elements and imaging devices. That is, all CRTFs could be captured at once using white light and be subsampled directed by a color filter array. These functions may be tailored through the design of the diffractive grating assembly 24. The CRTFs can provide a macroscopic description of the micro sampling of the diffraction pattern resulting from the arrangement of the diffraction gratings 28 of the diffraction grating assembly 24. It is to be noted that in absolute depth implementations, one could use a general RTF or use color-specific CRTFs to increase depth accuracy.
(107) It will be readily understood that while the implementations described above apply to circumstances where the pixel sampling frequency is greater than the period of the grating pattern, the processing method described herein may in some instance be useful in association with similar imaging devices for which the sampling frequency is equal to or less than the grating period (or in words, where the pixel array has a pixel pitch along the grating axis that is the same or greater than the grating period). In such a cases, the steps of reconstructing the 2D image may be omitted, as there is no chromatically-dependent blur pattern created in the 2D image by the action of the grating—both phase components are already included in each grating. In such embodiments the micro-chromatic dependence of the signal and associated is lost, but the macro-chromatic dependence may still be useful to provide depth information. In other words, the provision of different color channels combined with associated color radial transfer functions can provide depth information through comparisons of the different color channels to obtain their relative difference in focal position.
(108) Of course, numerous modifications could be made to the embodiments described above without departing from the scope of the present description.