References

2015. https://asknature.org/strategy/pupil-enables-clear-vision-in-extreme-light-conditions/.

Abbas, Syed Ammar, and Andrew Zisserman. 2019. “A Geometric Approach to Obtain a Bird’s Eye View from an Image.” In 2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW), 4095–4104.

Abbott, Edwin. 2009. Flatland. Broadview Press.

Acdx, user. 2009. “CIE 1931 XYZ Color Matching Functions.”

Adelson, E. H. 1992.

———. 1995.

———. 2000. “Lightness Perception and Lightness Illusions.” In The New Cognitive Neurosciences, edited by M. Gazzaniga, 339–51. Cambridge, MA: MIT Press.

———. 2001. “On Seeing Stuff: The Perception of Materials by Humans and Machines.” Proceedings of SPIE 4299 (June). https://doi.org/10.1117/12.429489.

Adelson, E. H., and J. R. Bergen. 1985. “Spatiotemporal Energy Models for the Perception of Motion.” Journal of the Optical Society of America A 2 (2): 284–99.

Adelson, E. H., and E. P. Simoncelli. 1987. “QMF Pyramids: A New Class of Orthogonal Pyramid Transform.” In Optical Society of America, Annual Meeting. Vol. A4–13.

Adelson, Edward H. 1995. “Checkershadow Illusion.”

Adelson, Edward H., and James R. Bergen. 1991. “The Plenoptic Function and the Elements of Early Vision.” In Computational Models of Visual Processing., edited by Michael S. Landy and Anthony J. Movshon, 3–20. Cambridge, MA: MIT Press.

Alayrac, Jean-Baptiste, Jeff Donahue, Pauline Luc, Antoine Miech, Iain Barr, Yana Hasson, Karel Lenc, et al. 2022. “Flamingo: A Visual Language Model for Few-Shot Learning.” In Nips, 35:23716–36.

Alda, Alan. 2014. “Alan Alda on Improvisation for Communication of Science.” https://www.youtube.com/watch?v=j4XgjkXDxss.

Amarasinghe, Saman, and Deanna Montgomery. 2022. “Faculty Job Talks: Tips from the Faculty.” https://www.eecs.mit.edu/career-opportunities-at-eecs/faculty-job-talks-tips-from-the-faculty/.

Andrychowicz, Marcin, Misha Denil, Sergio Gomez, Matthew W Hoffman, David Pfau, Tom Schaul, Brendan Shillingford, and Nando De Freitas. 2016. “Learning to Learn by Gradient Descent by Gradient Descent.” In Nips. Vol. 29.

Antol, Stanislaw, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv Batra, C Lawrence Zitnick, and Devi Parikh. 2015. “VQA: Visual Question Answering.” In Iccv, 2425–33.

Arbelaez, Pablo, Michael Maire, Charless Fowlkes, and Jitendra Malik. 2010. “Contour Detection and Hierarchical Image Segmentation.” Pami 33 (5): 898–916.

Ariely, Dan. 2001. “Seeing Sets: Representation by Statistical Properties.” Psychological Science 12 (2): 157–62.

Aristotle. 350 BC. On Sense and the Sensible. http://classics.mit.edu/Aristotle/sense.html.

Arnold, S. E. J., S. Faruq, V. Savolainen, P. W. McOwan, and L. Chittka. 2011. “FReD: The Floral Reflectance Database– a Web Portal for Analysis of Flower Colour.” PLoS ONE 5 (12): e14287.

Atick, Joseph J., and A. Norman Redlich. 1990. “Towards a Theory of Early Visual Processing.” Neural Computation 2 (3): 308–20.

———. 1992. “What Does the Retina Know about Natural Scenes?” Neural Computation 4 (2): 196–210.

Avni, Amir. 2016. http://www.whatimade.today/our-frst-reddit-bot-coloring-b-2/.

Azizi, Shekoofeh, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi, and David J Fleet. 2023. “Synthetic Data from Diffusion Models Improves Imagenet Classification.” https://arxiv.org/abs/2304.08466.

Ba, Jimmy Lei, Jamie Ryan Kiros, and Geoffrey E. Hinton. 2016. “Layer Normalization.” https://arxiv.org/abs/1607.06450.

Badrinarayanan, Vijay, Ankur Handa, and Roberto Cipolla. 2015. “SegNet: A Deep Convolutional Encoder-Decoder Architecture for Robust Semantic Pixel-Wise Labelling.” Pami 39: 2481–95.

Bahng, Hyojin, Ali Jahanian, Swami Sankaranarayanan, and Phillip Isola. 2022. “Exploring Visual Prompts for Adapting Large-Scale Models.” https://arxiv.org/abs/2203.17274.

Baker, Simon, Stefan Roth, Daniel Scharstein, Michael J. Black, J. P. Lewis, and Richard Szeliski. 2007. “A Database and Evaluation Methodology for Optical Flow.” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 1–8.

Balakrishnan, G., Y. Xiong, W. Xia, and P. Perona. 2020. “Towards Causal Benchmarking of Bias in Face Analysis Algorithms.” In European Conference on Computer Vision.

Ballard, Dana H. 1987. “Modular Learning in Neural Networks.” In Aaai, 647:279–84.

Bansal, Arpit, Eitan Borgnia, Hong-Min Chu, Jie S Li, Hamid Kazemi, Furong Huang, Micah Goldblum, Jonas Geiping, and Tom Goldstein. 2022. “Cold Diffusion: Inverting Arbitrary Image Transforms Without Noise.” https://arxiv.org/abs/2208.09392.

Bar, Amir, Yossi Gandelsman, Trevor Darrell, Amir Globerson, and Alexei Efros. 2022. “Visual Prompting via Image Inpainting.” In Nips, 35:25005–17.

Barber, G. 2019. “The Viral App That Labels You Isn’t Quite What You Think.” Wired.

Barnes, C., E. Shechtman, A. Finkelstein, and D. B. Goldman. 2009. “PatchMatch: A Randomized Correspondence Algorithm for Structural Image Editing.” In ACM SIGGRAPH: Proceedings of the Annual Conference on Computer Graphics and Interactive Techniques.

Barocas, Solon, Moritz Hardt, and Arvind Narayanan. 2019. Fairness and Machine Learning. fairmlbook.org.

Barron, Jonathan T. 2015. “Convolutional Color Constancy.” In Proceedings of the IEEE/CVF International Conference on Computer Vision.

Barron, Jonathan T., and Jitendra Malik. 2015. “Shape, Illumination, and Reflectance from Shading.” Pami 37: 1670–87.

Barrow, H. G., and J. M. Tenenbaum. 1978. “Recovering Intrinsic Scene Characteristics from Images.” In Computer Vision Systems, edited by A. R. Hanson and E. M. Riseman, 3–26. New York: Academic Press.

Baudes, A., B. Coll, and J.-M. Morel. 2011. “Non-Local Means Denoising.” In Image Processing on Line. Vol. 1.

Bay, H., T. Tuytelaars, and L. Van Gool. 2006. “SURF: Speeded up Robust Features.” In Eccv, 404–17.

Belkin, Mikhail, Daniel Hsu, Siyuan Ma, and Soumik Mandal. 2018. “Reconciling Modern Machine Learning and the Bias-Variance Trade-Off.” https://arxiv.org/abs/1812.11118.

Bengio, Yoshua, Aaron Courville, and Pascal Vincent. 2013. “Representation Learning: A Review and New Perspectives.” Pami 35 (8): 1798–828.

Benjamin, Ruha. 2019. Race After Technology. Polity.

Bennett, Cynthia L., Cole Gleason, Morgan Klaus Scheuerman, Jeffrey P. Bigham, Anhong Guo, and Alexandra To. 2021. “’It’s Complicated’: Negotiating Accessibility and (Mis)representation in Image Descriptions of Race, Gender, and Disability.” In CHI 2021.

Bergen, J. R., and E. H. Adelson. 1988. “Visual Texture Segmentation and Early Vision.” Nature 333: 363–64.

Biederman, I. 1987. “Recognition by Components - a Theory of Human Image Understanding.” Psychological Review 94 (2).

Biederman, Irving. 1976. “On Processing Information from a Glance at a Scene: Some Implications for a Syntax and Semantics of Visual Processing.” In Proceedings of the ACM/SIGGRAPH Workshop on User-Oriented Design of Interactive Graphics Systems, 75–88.

Binford, Thomas O. 1971. “Visual Perception by Computer.” In Proceedings of the IEEE Conference on Systems and Control (Miami, FL).

Birhane, A., and V. U. Prabhu. 2021. “Large Image Datasets: A Pyrrhic Win for Computer Vision?” In IEEE/CVF Winter Conference on Applications of Computer Vision.

Bishop, C. M. 2006. Pattern Recognition and Machine Learning. Springer-Verlag.

Blake, A., P. Kohli, and C. Rother. 2011. Markov Random Fields for Vision and Image Processing. Cambridge, MA: MIT Press.

Bleasdale, Cecilia. 2015. https://en.wikipedia.org/wiki/The_dress.

Bouman, Katherine L., Vickie Ye, Adam B. Yedidia, Fredo Durand, Gregory W. Wornell, Antonio Torralba, and William T. Freeman. 2018. “Turning Corners into Cameras: Principles and Methods.” In Iccv.

Bourlard, Hervé, and Yves Kamp. 1988. “Auto-Association by Multilayer Perceptrons and Singular Value Decomposition.” Biological Cybernetics 59 (4): 291–94.

Boyer, Carl B. 1946. “Aristotelian References to the Law of Reflection.” Isis 36 (2): 92–95.

Brainard, D. H., and W. T. Freeman. 1997. “Bayesian Color Constancy.” Journal of the Optical Society of America A 14 (7): 1393–1411.

Brainard, David H., and Anya C. Hurlbert. 2015. “Colour Vision: Understanding #TheDress.” Current Biology 25: R551–54.

Brock, Andrew, Jeff Donahue, and Karen Simonyan. 2019. “Large Scale GAN Training for High Fidelity Natural Image Synthesis.” International Conference on Learning Representations.

Brooks, Tim, Aleksander Holynski, and Alexei A Efros. 2023. “Instructpix2pix: Learning to Follow Image Editing Instructions.” In Cvpr.

Brown, Tom, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, et al. 2020. “Language Models Are Few-Shot Learners.” In Nips, 33:1877–1901.

Buades, A., B. Coll, and J.-M. Morel. 2005. “A Non-Local Algorithm for Image Denoising.” In Cvpr, 2:60–65 vol. 2.

Buolamwini, J., and T. Gebru. 2018. “Intersectional Accuracy Disparities in Commercial Gender Classification.” In Proceedings of Machine Learning Research Conference on Fairness, Accountability, and Transparency., 81:1–15.

Burt, P. J., and E. H. Adelson. 1983. “The Laplacian Pyramid as a Compact Image Code.” IEEE Transactions on Communications 31 (4): 532–40.

Burton, Harry Edwin. 1945. “The Optics of Euclid.” J. Opt. Soc. Am. 35 (5): 357–72.

Butler, D. J., J. Wulff, G. B. Stanley, and M. J. Black. 2012. “A Naturalistic Open Source Movie for Optical Flow Evaluation.” In Eccv, edited by A. Fitzgibbon et al., 611–25. Part IV, LNCS 7577. Springer-Verlag.

Canny, J. F. 1986. “A Computational Approach to Edge Detection.” IEEE Transactions on Pattern Analysis and Machine Intelligence 8 (6): 679–98.

Caron, Mathilde, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal, Piotr Bojanowski, and Armand Joulin. 2021. “Emerging Properties in Self-Supervised Vision Transformers.” In Iccv, 9650–60.

Cavanagh, P. 1996. “Vision Is Getting Easier Every Day.” Perception 24: 1227–32.

Cavazos, Jacqueline G., P. Jonathon Phillips, Carlos D. Castillo, and Alice J. O’Toole. 2021. “Accuracy Comparison Across Face Recognition Algorithms: Where Are We on Measuring Race Bias?” IEEE Transactions on Biometrics, Behavior, and Identity Science 3: 101–11.

Chai, Lucy, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, and Richard Zhang. 2021. “Ensembling with Deep Generative Views.” In Cvpr, 14997–5007.

Chang, Chin-Kai, Jiaping Zhao, and Laurent Itti. 2018. “DeepVP: Deep Learning for Vanishing Point Detection on 1 Million Street View Images.” In 2018 IEEE International Conference on Robotics and Automation (ICRA), 1–8.

Chang, Jia-Ren, and Yong-Sheng Chen. 2018. “Pyramid Stereo Matching Network.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Chechik, Gal, Varun Sharma, Uri Shalit, and Samy Bengio. 2010. “Large Scale Online Learning of Image Similarity Through Ranking.” Journal of Machine Learning Research 11 (3).

Chehikian, A., and James Crowley. 1991. “Fast Computation of Optimal Semi-Octave Pyramids.” Scia, 18–27.

Chen, Mark, Alec Radford, Rewon Child, Jeffrey Wu, Heewoo Jun, David Luan, and Ilya Sutskever. 2020. “Generative Pretraining from Pixels.” In Icml, 1691–1703.

Chen, Ting, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton. 2020. “A Simple Framework for Contrastive Learning of Visual Representations.” In Icml, 1597–607.

CIE. 1931. “CIE 1931 XYZ Color Matching Functions.” https://commons.wikimedia.org/wiki/File:CIE1931_RGBCMF.png.

Cohen, Adam L. 1982. “Anti-Pinhole Imaging.” Optica Acta: Intl. J. Of Optics 29 (1).

Cooley, James W., and John W. Tukey. 1965. “An Algorithm for the Machine Calculation of Complex Fourier Series.” Mathematics of Computation 19: 297–301.

Cortes, Corinna, and Vladimir Vapnik. 1995. “Support-Vector Networks.” Machine Learning 20: 273–97.

Coughlan, James, and Alan L. Yuille. 1999. “Manhattan World: Compass Direction from a Single Image by Bayesian Inference.” In Iccv, 941–47.

Criminisi, Antonio. 1999. “Accurate Visual Metrology from Single and Multiple Uncalibrated Images.” PhD thesis.

Csurka, G., C. Bray, C. Dance, and L. Fan. 2004. “Visual Categorization with Bags of Keypoints.” Workshop on Statistical Learning in Computer Vision, ECCV, 1–22.

Cummings, M. L. 2004. “Automation Bias in Intelligent Time Critical Decision Support Systems.” In AIAA Third Intelligent Systems Conference.

Curcio, Christine A., Kenneth R. Sloan, Robert E. Kalina, and Anita E. Hendrickson. 1990. “Human Photoreceptor Topography.” Journal of Comparative Neurology 292 (4): 497–523.

Curcio, Christine A., Kenneth R. Sloan, Orin S. Packer, Anita Hendrickson, and Robert E. Kalina. 1987. “Distribution of Cones in Human and Monkey Retina: Individual Variability and Radial Asymmetry.” Science 236 4801: 579–82.

Curless, Brian, and Marc Levoy. 1996. “A Volumetric Method for Building Complex Models from Range Images.” In Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, 303–12.

Cybenko, G. 1989. “Approximation by Superpositions of a Sigmoidal Function.” Mathematics of Control, Signals and Systems 2 (4): 303–14.

d’Alessandro, Brian, Cathy O’Neil, and Tom LaGatta. 2017. “Conscientious Classification: A Data Scientist’s Guide to Discrimination-Aware Classification.”

Dalal, N., and B. Triggs. 2005. “Histograms of Oriented Gradients for Human Detection.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Dalmotas, Dainius J., Regina M. Hurley, and Alan German. 1985. “Air Bag Deployments Involving Restrained Occupants.” SAE Transactions 104 (6): 1507–12.

Darrell, Trevor, and Eero Simoncelli. 1993. “On the Use of ’Nulling’ Filters to Separate Transparent Motions.” In Cvpr.

Daugman, J. G. 1989. “Entropy Reduction and Decorrelation in Visual Coding by Oriented Neural Receptive Fields.” IEEE Transactions on Biomedical Engineering 36 (1): 107–14.

De Valois, R. L., K. K. De Valois, and Oxford University Press. 1988. Spatial Vision. Oxford Psychology Series. Oxford University Press.

DeBonet, J. S., and P. Viola. 1998. “Texture Recognition Using a Non-Parametric Multi-Scale Statistical Model.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Deglr6328. 2006. “Blue Sky Spectrum.” https://en.wikipedia.org/, File:Spectrum_of_blue_sky.png.

Denton, Emily, Ben Hutchinson, Margaret Mitchell, Timnit Gebru, and Andrew Zaldivar. 2019. “Image Counterfactual Sensitivity Analysis for Detecting Unintended Bias.” In Computer Vision and Pattern Recognition Workshop.

DeTone, D., T. Malisiewicz, and A. Rabinovich. 2018. “SuperPoint: Self-Supervised Interest Point Detection and Description.” In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 337–33712.

DeVries, Terrance, Ishan Misra, Changhan Wang, and Laurens van der Maaten. 2019. “Does Object Recognition Work for Everyone?” In Computer Vision and Pattern Recognition Workshop.

Doersch, Carl, Abhinav Gupta, and Alexei A Efros. 2015. “Unsupervised Visual Representation Learning by Context Prediction.” In Iccv, 1422–30.

Doersch, Carl, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei A. Efros. 2012. “What Makes Paris Look Like Paris?” ACM Transactions on Graphics 31 (4): 101:1–9.

Doherty, Paul. 2023. https://www.exploratorium.edu/snacks/cd-spectroscope.

Donahue, Jeff, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang, Eric Tzeng, and Trevor Darrell. 2014. “Decaf: A Deep Convolutional Activation Feature for Generic Visual Recognition.” In Icml, 647–55. PMLR.

Donahue, Jeff, Philipp Krähenbühl, and Trevor Darrell. 2017. “Adversarial Feature Learning.” In Iclr.

Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, et al. 2021. “An Image Is Worth 16x16 Words: Transformers for Image Recognition at Scale.” Iclr.

Dosovitskiy, Alexey, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner Hazirbas, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, and Thomas Brox. 2015. “FlowNet: Learning Optical Flow with Convolutional Networks.” In Iccv, 2758–66.

Duda, Richard O., and Peter E. Hart. 1972. “Use of the Hough Transformation to Detect Lines and Curves in Pictures.” Communications of the ACM 15 (1): 11–15.

Dwork, C., and A. Roth. 2014. “The Algorithmic Foundations of Differential Privacy.” Foundations and Trends in Theoretical Computer Science 9 (3–4): 211–407.

Dwork, Cynthia, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Rich Zemel. 2012. “Fairness Through Awareness.” In ITCS ’12: Proceedings of the Third Innovations in Theoretical Computer Science Conference, 214–26.

Dwork, Cynthia, Nitin Kohli, and Deirdre Mulligan. 2019. “Differential Privacy in Practice: Expose Your Epsilons!” Journal of Privacy and Confidentiality 9 (2).

Efros, A. A., and W. T. Freeman. 2001. “Image Quilting for Texture Synthesis and Transfer.” In ACM SIGGRAPH: Proceedings of the Annual Conference on Computer Graphics and Interactive Techniques, 341–46.

Efros, A. A., and T. K. Leung. 1999. “Texture Synthesis by Non-Parametric Sampling.” In Proceedings of the IEEE/CVF International Conference on Computer Vision.

Elman, Jeffrey L. 1990. “Finding Structure in Time.” Cognitive Science 14 (2): 179–211.

Elsayed, Gamaleldin F, Ian Goodfellow, and Jascha Sohl-Dickstein. 2018. “Adversarial Reprogramming of Neural Networks.” https://arxiv.org/abs/1806.11146.

Everingham, M., L. Van Gool, C. K. I. Williams, J. Winn, and A. Zisserman. 2010. “The PASCAL Visual Object Classes (VOC) Challenge.” International Journal of Computer Vision 88: 303–38.

Fara, P. 2015. “Newton Shows the Light: A Commentary on Newton (1672) ‘a Letter … Containing His New Theory about Light and Colours…’.” Philosophical Transactions of the Royal Society.

Farrell, Michael, and Cliff Haynes. 2017. Straw Camera. https://strawcamera.com/.

Fashion, Amazon. 2021. “Fun World Adult Cockroach Costume.” https://www.amazon.com/Fun-World-Costumes-Cockroach-Costume/dp/B0038ZQYRC.

Faugeras, Olivier. 1993. Three-Dimensional Computer Vision: A Geometric Viewpoint. Cambridge, MA: MIT Press.

Fellbaum, Christiane, ed. 1998. WordNet: An Electronic Lexical Database. Cambridge, MA: MIT Press.

Felzenszwalb, Pedro F., Ross B. Girshick, David McAllester, and Deva Ramanan. 2010. “Object Detection with Discriminatively Trained Part-Based Models.” Pami 32 (9): 1627–45.

Fergus, Rob, Pietro Perona, and Andrew Zisserman. 2007. “Weakly Supervised Scale-Invariant Learning of Models for Visual Recognition.” International Journal of Computer Vision 71 (3): 273–303.

Fergus, R., B. Singh, A. Hertzmann, S. Roweis, and W. T. Freeman. 2006. “Removing Camera Shake from a Single Image.” ACM Transactions on Graphics 25 (3): 787--794.

Field, David J. 1987. “Relations Between the Statistics of Natural Images and the Response Properties of Cortical Cells.” Josa 4 (12): 2379–94.

Fischler, M. A., and R. C. Bolles. 1981. “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography.” Communications of the ACM 24 (6): 381–95.

Fischler, M. A., and R. A. Elschlager. 1973. “The Representation and Matching of Pictorial Structures.” IEEE Transactions on Computers C-22 (1): 67–92.

Fleet, D., and A. Jepson. 1989. “Computation of Normal Velocity from Local Phase Information.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition, 379–86.

Fleming, R. W., R. O. Dror, and E. H. Adelson. 2001. “Surface Reflectance Estimation Under Unknown Natural Illumination.” Journal of Vision.

Fleuret, Francois, and Donald Geman. 2001. “Coarse-to-Fine Face Detection.” Ijcv 41 (1–2): 85--107.

Fodor, Jerry A. 1975. The Language of Thought. Vol. 5. Cambridge, MA: Harvard University Press.

Forsyth, David A., and Jean Ponce. 2012. Computer Vision - a Modern Approach, Second Edition. Pitman.

Fourier, Jean Baptiste Joseph. 2009. Théorie Analytique de la Chaleur. Cambridge Library Collection. Cambridge University Press.

Freedman, David H. 2010. Wrong: Why Experts* Keep Failing Us–and How to Know When Not to Trust Them. New York: Little, Brown & Co.

Freeman, W. T. 1994. “The Generic Viewpoint Assumption in a Framework for Visual Perception.” Nature 368 (6471): 542–45.

Freeman, W. T., and E. H. Adelson. 1990. “Steerable Filters for Early Vision, Image Analysis, and Wavelet Decomposition.” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 406–15.

———. 1991. “The Design and Use of Steerable Filters.” Pami 13 (9): 891–906.

Freeman, W. T., E. H. Adelson, and D. J. Heeger. 1991. “Motion Without Movement.” In ACM SIGGRAPH: Proceedings of the Annual Conference on Computer Graphics and Interactive Techniques, 27–30.

Freeman, W. T., E. H. Adelson, and A. P. Pentland. 1990. “Shape-from-Shading Analysis with Bumplets and Shadelets.” Investigative Ophthalmology and Visual Science (ARVO), 410.

Freeman, W. T., D. B. Anderson, P. A. Beardsley, C. N. Dodge, M. Roth, C. D. Weissman, W. S. Yerazunis, et al. 1998. “Computer Vision for Interactive Computer Graphics.” IEEE Computer Graphics and Applications 18 (3): 42–53.

Freeman, W. T., and D. H. Brainard. 1995. “Bayesian Decision Theory, the Maximum Local Mass Estimate, and Color Constancy.” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 210–17.

Freeman, William T. 2020. “How to Write Good Papers.” In.

Fridovich-Keil, Sara, Alex Yu, Matthew Tancik, Qinhong Chen, Benjamin Recht, and Angjoo Kanazawa. 2022. “Plenoxels: Radiance Fields Without Neural Networks.” In Cvpr, 5501–10.

Fukushima, Kunihiko. 1980. “Neocognitron: A Self-Organizing Neural Network Model for a Mechanism of Pattern Recognition Unaffected by Shift in Position.” Biological Cybernetics 36 (4): 193–202.

Gabor, Dennis. 1946. “Theory of Communication.” Journal of the Institution of Electrical Engineers - Part I: General 94: 58–58.

Gage, Philip. 1994. “A New Algorithm for Data Compression.” The C Users Journal 12 (2): 23–38.

Gagniuc, P. A. 2017. Markov Chains: From Theory to Implementation and Experimentation. John Wiley & Sons.

Galileo, G. 2015. Sidereus Nuncius, or the Sidereal Messenger. Chicago: University of Chicago Press.

Gardenia. n.d. https://www.gardenia.net/plants/plant-family/hepatica-liverleaf.

Garvie, Clare, Alvaro Bedoya, and Jonathan Frankle. 2019. “The Perpetual Line-up.” https://www.perpetuallineup.org/.

Gebru, Timnit, and Emily Denton. 2020. “CVPR Tutorial on Fairness, Accountability, Transparency, and Ethics in Computer Vision.”

———. 2021. “CVPR Workshop: Beyond Fairness: Towards a Just, Equitable, and Accountable Computer Vision.” https://sites.google.com/view/beyond-fairness-cv/.

Geiger, A, P Lenz, C Stiller, and R Urtasun. 2013. “Vision Meets Robotics: The KITTI Dataset.” The International Journal of Robotics Research 32 (11): 1231–37.

Geman, Donald, Bruno Jedynak, Programme Robotique, and Projet Syntim. 1994. “Shape Recognition and Twenty Questions.” In Proceedings Reconnaissance Des Formes Et Intelligence Articielle, 21–37.

Gibson, James J. 1966. The Senses Considered as Perceptual Systems. Boston: Houghton Mifflin.

———. 1979. The Ecological Approach to Visual Perception. Boston: Houghton Mifflin.

Gilbert, Rob. n.d. “Quotes.” https://www.quotes.net/quote/13310.

Gilchrist, Alan. 2006. Seeing Black and White. Oxford University Press.

Gilmer, Justin, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals, and George E. Dahl. 2017. “Neural Message Passing for Quantum Chemistry.” In International Conference on Machine Learning, 1263–72.

Gkioxari, G., J. Johnson, and J. Malik. 2019. “Mesh r-CNN.” In Iccv, 9784–94.

Glickstein, Mitch. 2006. “Golgi and Cajal: The Neuron Doctrine and the 100th Anniversary of the 1906 Nobel Prize.” Current Biology 16 (5): R147–51.

Goodfellow, Ian, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. Cambridge, MA: MIT Press.

Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014. “Generative Adversarial Nets.” In Nips. Vol. 27.

Gorkani, M. M., and R. W. Picard. 1994. “Texture Orientation for Sorting Photos ’at a Glance.’.” In Proceedings of 12th International Conference on Pattern Recognition, 1:459–464 vol.1.

Gortler, Steven J., Radek Grzeszczuk, Richard Szeliski, and Michael F. Cohen. 1996. “The Lumigraph.” In Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, 43–54.

Gou, Jianping, Baosheng Yu, Stephen J Maybank, and Dacheng Tao. 2021. “Knowledge Distillation: A Survey.” Ijcv 129: 1789–819.

Granlund, G. H. 1978. “In Search of a General Picture Processing Operator.” Computer Graphics, Image Proc. 8: 155–73.

Granlund, G., and H. Knutsson. 1995. Signal Processing for Computer Vision. New York, NY: Springer.

Granlund, Goesta H. 1978. “In Search of a General Picture Processing Operator.” Computer Graphics and Image Processing 8 (2): 155–73.

Griffin, Gregory, Alex Holub, and Pietro Perona. 2007. “Caltech-256 Object Category Dataset.”

Grother, Patrick, Mei Ngan, and Kayee Hanaoka. 2019. “Face Recognition Vendor Test (FRVT). Part 3: Demographic Effects.” NISTIR 8280.

Grünwald, Peter D. 2007. The Minimum Description Length Principle. Cambridge, MA: MIT Press.

Gupta, Tanmay, and Aniruddha Kembhavi. 2023. “Visual Programming: Compositional Visual Reasoning Without Training.” In Cvpr, 14953–62.

Ha, David, Andrew Dai, and Quoc V Le. 2016. “Hypernetworks.” Iclr.

Hadsell, Raia, Sumit Chopra, and Yann LeCun. 2006. “Dimensionality Reduction by Learning an Invariant Mapping.” In Cvpr, 2:1735–42.

Hamidi, Foad, Morgan Klaus Scheuerman, and Stacy M. Branham. 2018. “Gender Recognition or Gender Reductionism?: The Social Implications of Embedded Gender Recognition Systems.” In CHI ’18: Proceedings of the 2018 CHI Conference on Human Factors in Computing Systems.

Hammersley, J. M., and P. Clifford. 1971. “Markov Fields on Finite Graphs and Lattices.” http://www.statslab.cam.ac.uk/~grg/books/hammfest/hamm-cliff.pdf.

Hancock, Peter, Roland Baddeley, and Leslie Smith. 1970. “The Principal Components of Natural Images.” Network: Computation in Neural Systems 3.

Hardt, Moritz. 2020. “MLSS 2020, Tübingen.”

Harmon, Leon D., and Bela Julesz. 1973. “Masking in Visual Recognition: Effects of Two-Dimensional Filtered Noise.” Science 180 (4091): 1194–97.

Harris, Chris, and Mike Stephens. 1988. “A Combined Corner and Edge Detector.” In Proc. Of Fourth Alvey Vision Conference, 147–51.

Hartley, R., and A. Zisserman. 2004. Multiple View Geometry in Computer Vision. 2nd ed. Cambridge, UK: Cambridge University Press.

Hartline, H. K. 1938. “The Response of Single Optic Nerve Fibers of the Vertebrate Eye to Illumination of the Retina.” American Journal of Physiology-Legacy Content 121 (2): 400–415.

Hassenstein, V., and W. Reichardt. 1956. “System Theoretical Analysis of Time, Sequence and Sign Analysis of the Motion Perception of the Snout-Beetle Chlorophanus.” Z. Naturforsch. B 11: 513–24.

Hays, James, and Alexei A Efros. 2007. “Scene Completion Using Millions of Photographs.” Tog 26 (3): 4.

He, Kaiming, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and Ross Girshick. 2022. “Masked Autoencoders Are Scalable Vision Learners.” In Cvpr, 15979–88.

He, Kaiming, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020. “Momentum Contrast for Unsupervised Visual Representation Learning.” In Cvpr, 9729–38.

He, Kaiming, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017. “Mask r-CNN.” In Iccv, 2961–69.

He, Kaiming, Jian Sun, and Xiaoou Tang. 2009. “Single Image Haze Removal Using Dark Channel Prior.” In Cvpr, 1956–63.

He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016. “Deep Residual Learning for Image Recognition.” In Cvpr, 770–78.

He, Ruifei, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr, Song Bai, and Xiaojuan Qi. 2022. “Is Synthetic Data from Generative Models Ready for Image Recognition?” https://arxiv.org/abs/2210.07574.

Hebb, Donald Olding. 2005. The Organization of Behavior: A Neuropsychological Theory. Psychology Press.

Hecht, Eugene. 2016. Optics. 5th ed. Hoboken, NJ: Pearson.

Heeger, D. 1995.

Heeger, D. J., and E. P. Simoncelli. 1992. “Model of Visual Motion Sensing.” In Spatial Vision in Humans and Robots, edited by L. Harris and M. Jenkin. Cambridge Univ. Press.

Heeger, David J., and James R. Bergen. 1995. “Pyramid-Based Texture Analysis/Synthesis.” In Computer Graphics Proceedings, 229–38.

Helmholtz, H. Von. 1962. Treatise on Physiological Optics. Vol. III. New York: Dover.

Helmholtz, Hermann von. 1925. Helmholtz’s Treatise on Physiological Optics. Optical Society of America.

Hinton, Geoffrey E. 2002. “Training Products of Experts by Minimizing Contrastive Divergence.” Neural Computation 14 (8): 1771–1800.

Hinton, Geoffrey, Oriol Vinyals, and Jeff Dean. 2015. “Distilling the Knowledge in a Neural Network.” https://arxiv.org/abs/1503.02531.

Hirschmüller, H. 2007. “Stereo Processing by Semi-Global Matching and Mutual Information.” 30 (2): 328–41.

Ho, Jonathan, Ajay Jain, and Pieter Abbeel. 2020. “Denoising Diffusion Probabilistic Models.” In Nips, 33:6840–51.

Hochreiter, Sepp, and Jürgen Schmidhuber. 1997. “Long Short-Term Memory.” Neural Computation 9 (8): 1735–80.

Hofer, H., J. Carroll, J. Neitz, M. Neitz, and D. R. Williams. 2005. “Organization of the Human Trichromatic Cone Mosaic.” The Journal of Neuroscience 25 (42): 9669–79.

Hoffman, Judy, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola, Kate Saenko, Alexei Efros, and Trevor Darrell. 2018. “Cycada: Cycle-Consistent Adversarial Domain Adaptation.” In Icml, 1989–98. PMLR.

Hoiem, Derek, Alexei A. Efros, and Martial Hebert. 2005. “Automatic Photo Pop-up.” Tog 24 (3): 577–84.

———. 2008. “Putting Objects in Perspective.” Ijcv 80 (1): 3–15.

Hoogterp, W. 2014. Your Perfect Presentation. McGraw-Hill Education eBooks.

Hopfield, John J. 1982. “Neural Networks and Physical Systems with Emergent Collective Computational Abilities.” Proceedings of the National Academy of Sciences 79 (8): 2554–58.

Horn, B. K. P. 1986. Robot Vision. Cambridge, MA: MIT Press.

Horn, B. K. P., and M. J. Brooks, eds. 1989. Shape from Shading. Cambridge, MA: MIT Press.

Horn, B. K. P., and B. G. Schunck. 1981. “Determining Optical Flow.” Artificial Intelligence 17: 185–203.

Horn, Berthold K. P. 1977. “Understanding Image Intensities.” Artificial Intelligence 8 (2): 201–31.

Hosni, Asmaa, Christoph Rhemann, Michael Bleyer, Carsten Rother, and Margrit Gelautz. 2013. “Fast Cost-Volume Filtering for Visual Correspondence and Beyond.” IEEE Transactions on Pattern Analysis and Machine Intelligence 35 (2): 504–11.

Hospital, San Jose Animal. 2021. http://www.sanjoseanimalhospital.com/puppy-and-kitten-packages.

Hough, Paul V. C. 1959. “Machine Analysis of Bubble Chamber Pictures.” In International Conference on High Energy Accelerators and Instrumentation, CERN, 1959, 554–56.

Houthooft, Rein, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip Wolski, Jonathan Ho, and Pieter Abbeel. 2018. “Evolved Policy Gradients.” In Nips.

Hu, Edward J, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li, Shean Wang, Lu Wang, and Weizhu Chen. 2022. “LoRa: Low-Rank Adaptation of Large Language Models.” In Iclr.

Huang, Yanping, and Rajesh P. N. Rao. 2011. “Predictive Coding.” Wiley Interdisciplinary Reviews: Cognitive Science 2 (5): 580–93.

Hubel, D. H., and T. N. Wiesel. 1959. “Receptive Fields of Single Neurones in the Cat’s Striate Cortex.” The Journal of Physiology 148 (3): 574–91. https://doi.org/10.1113/jphysiol.1959.sp006308.

———. 1962. “Receptive Fields, Binocular Interaction and Functional Architecture in the Cat’s Visual Cortex.” J. Physiol. 160: 106–54.

Hutchinson, Ben, and Margaret Mitchell. 2019. “50 Years of Test (Un)fairness: Lessons for Machine Learning.” In FAT* ’19: Proceedings of the Conference on Fairness, Accountability, and Transparency, 49–58.

Huxley, Aldus. 1932. Brave New World. Chatto; Windus.

Ioffe, Sergey, and Christian Szegedy. 2015. “Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift.” In Icml, 448–56.

Isola, Phillip, Joseph J. Lim, and Edward H. Adelson. 2015. “Discovering States and Transformations in Image Collections.” In Cvpr.

Isola, Phillip, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017. “Image-to-Image Translation with Conditional Adversarial Networks.” In Cvpr.

Jaderberg, Max, Karen Simonyan, Andrew Zisserman, and koray kavukcuoglu. 2015. “Spatial Transformer Networks.” In Nips, 2017–25.

Jahanian, Ali, Xavier Puig, Yonglong Tian, and Phillip Isola. 2022. “Generative Models as a Data Source for Multiview Representation Learning.” In Iclr.

Jefferys, William H., and James O. Berger. 1992. “Ockham’s Razor and Bayesian Analysis.” American Scientist 80 (1): 64–72.

Jia, Menglin, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie, Bharath Hariharan, and Ser-Nam Lim. 2022. “Visual Prompt Tuning.” In Eccv, 709–27.

Johansson, Gunnar. 1973. “Visual Perception of Biological Motion and a Model for Its Analysis.” Perception & Psychophysics 14 (2): 201–11.

Jordan, M. I., ed. 1998. Learning in Graphical Models. Cambridge MA: MIT Press.

Julesz, Bela. 1981. “Textons, the Elements of Texture Perception, and Their Interactions.” Nature 290 (5802): 91–97.

Julez, B. 1971. Foundations of Cyclopean Perception. University of Chicago Press.

Kac, Mark. 1966. “Can One Hear the Shape of a Drum?” American Mathematical Monthly.

Kahn, Jeremy. 2021. “HireVue Drops Facial Monitoring Amid AI Algorithm Audit.” Fortune.

Kahneman, Daniel. 2011. Thinking, Fast and Slow. Macmillan.

Kajiya, James T. 1986. “The Rendering Equation.” In Siggraph, 143--150.

Kajiya, Jim. 1993. “How to Get Your SIGGRAPH Paper Rejected.” In SIGGRAPH Papers Chair.

Kalderon, Mark Eli. 2015. Form Without Matter: Empedocles and Aristotle on Color Perception. Oxford, UK: Oxford University Press.

Kandel, Eric R., James H. Schwartz, and Thomas M. Jessell, eds. 1991. Principles of Neural Science. Third. New York: Elsevier.

Kanizsa, Gaetano. 1979. Organization in Vision: Essays on Gestalt Perception. New York: Praeger Publishers.

Kanwisher, Nancy G., Josh McDermott, and Marvin M. Chun. 1997. “The Fusiform Face Area: A Module in Human Extrastriate Cortex Specialized for Face Perception.” The Journal of Neuroscience 17: 4302–11.

Karnieli, Asaf, Ohad Fried, and Yacov Hel-Or. 2022. “DeepShadow: Neural Shape from Shadow.” In Eccv, 415–30.

Karras, Tero, Miika Aittala, Samuli Laine, Erik Härkönen, Janne Hellsten, Jaakko Lehtinen, and Timo Aila. 2021. “Alias-Free Generative Adversarial Networks.” In Nips.

Kaufman, E L, and M W Lord. 1949. “The Discrimination of Visual Number.” The American Journal of Psychology 62 (4): 498–525.

Kearns, M., and A. Roth. 2020. The Ethical Algorithm: The Science of Socially Aware Algorithm Design. Oxford University Press.

Kendall, A., H. Martirosyan, S. Dasgupta, P. Henry, R. Kennedy, A. Bachrach, and A. Bry. 2017. “End-to-End Learning of Geometry and Context for Deep Stereo Regression.” In Proceedings of the IEEE/CVF International Conference on Computer Vision.

Kerbl, Bernhard, Georgios Kopanas, Thomas Leimkühler, and George Drettakis. 2023. “3D Gaussian Splatting for Real-Time Radiance Field Rendering.” Tog 42 (4): 1–14.

Kim, Taeksoo, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim. 2017. “Learning to Discover Cross-Domain Relations with Generative Adversarial Networks.” In Icml, 1857–65.

Kingdom, Frederick A. A., Ali Yoonessi, and Elena Gheorghiu. 2017. “The Leaning Tower Illusion.” In The Oxford Compendium of Visual Illusions. Oxford University Press.

Kingma, Diederik P., and Jimmy Ba. 2014. “Adam: A Method for Stochastic Optimization.” https://arxiv.org/abs/1412.6980.

Kingma, Diederik P, and Max Welling. 2014. “Auto-Encoding Variational Bayes.” Iclr.

Kingma, Diederik P, Max Welling, et al. 2019. “An Introduction to Variational Autoencoders.” Foundations and Trends in Machine Learning 12 (4): 307–92.

Kingma, Diederik, Tim Salimans, Ben Poole, and Jonathan Ho. 2021. “Variational Diffusion Models.” In Nips, 34:21696–707.

Kipf, T., and M. Welling. 2017. “Semi-Supervised Classification with Graph Convolutional Networks.” https://arxiv.org/abs/1609.02907.

Kirillov, Alexander, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, et al. 2023. “Segment Anything.” In Iccv, 4015–26.

Klare, B. F., M. J. Burge, J. C. Klontz, R. W. Vorder Bruegge, and A. K. Jain. 2012. “Face Recognition Performance: Role of Demographic Information.” IEEE Trans. On Information Forensics and Security 7 (6): 1789–1801.

Knill, David C., Pascal Mamassian, and Daniel Kersten. 1997. “Geometry of Shadows.” Josa 14 (12): 3216–32.

Knuth, D. E., T. L. Larrabee, and P. M. Roberts. 1989. Mathematical Writing. Mathematical Association of America Notes.

Koch, C, and S Ullman. 1985. “Shifts in Selective Visual Attention: Towards the Underlying Neural Circuitry.” Human Neurobiology 4 (4): 219–27.

Koenderink, J. J. 1988. “Image Structure.” In Mathematics and Computer Science in Medical Imaging, edited by M. A. Viergever and A. E. Todd-Pokropek, 67–103. Berlin: Springer-Verlag.

———. 1990. Solid Shape. Cambridge, MA: MIT Press.

Koenderink, J. J., and A. J. van Doorn. 1987. “Representation of Local Geometry in the Visual System.” Biological Cybernetics 55: 367–75.

Koffka, Kurt. 1935. Principles of Gestalt Psychology. London: Routledge.

Koh, Pang Wei, Thao Nguyen, Yew Siang Tang, Stephen Mussmann, Emma Pierson, Been Kim, and Percy Liang. 2020. “Concept Bottleneck Models.” In Icml, 5338–48.

Koller, D., and N. Friedman, eds. 2009. Probabilistic Graphical Models. Cambridge MA: MIT Press.

Kolmogorov, V. 2006. “Convergent Tree-Reweighted Message Passing for Energy Minimization.” IEEE Transactions on Pattern Analysis and Machine Intelligence 28 (10).

Komodakis, Nikos, and Spyros Gidaris. 2018. “Unsupervised Representation Learning by Predicting Image Rotations.” In Iclr.

Krishna, Ranjay, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata, Joshua Kravitz, Stephanie Chen, et al. 2017. “Visual Genome: Connecting Language and Vision Using Crowdsourced Dense Image Annotations.” Ijcv 123 (1): 32--73.

Krizhevsky, Alex. 2009. “Learning Multiple Layers of Features from Tiny Images.” University of Toronto, Toronto.

Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E Hinton. 2012. “Imagenet Classification with Deep Convolutional Neural Networks.” In Nips, 25:1097–1105.

Kuffler, Stephen W. 1953. “Discharge Patterns and Functional Organization of Mammalian Retina.” Journal of Neurophysiology 16 (1): 37–68.

Lamott, Anne. 1980. Bird by Bird. Bantam Doubleday Dell Publishing Group.

Land, E. H. 1983. “Recent Advances in Retinex Theory and Some Implications for Cortical Computations: Color Vision and the Natural Image.” Proceedings of the National Academy of Sciences of the USA 80: 5163–69.

Land, Edwin Herbert, and John J. McCann. 1971. “Lightness and Retinex Theory.” Journal of the Optical Society of America 61 1: 1–11.

Larsson, Gustav, Michael Maire, and Gregory Shakhnarovich. 2016. “Learning Representations for Automatic Colorization.” In Eccv, 577–93. Springer.

LeCun, Y. 2006. “A Tutorial on Energy-Based Learning.” http://www.cs.toronto.edu/~vnair/ciar/lecun1.pdf.

———. 2007. “Energy-Based Models: The Cure Against Bayesian Fundamentalism.” https://www.mit.edu/~9.520/spring07/Classes/lecun-20070502-mit.pdf.

LeCun, Yann, Bernhard Boser, John S Denker, Donnie Henderson, Richard E Howard, Wayne Hubbard, and Lawrence D Jackel. 1989. “Backpropagation Applied to Handwritten Zip Code Recognition.” Neural Computation 1 (4): 541–51.

Lepetit, V., and P. Fua. 2006. “Keypoint Recognition Using Randomized Trees.” Pami 28 (9): 1465–79.

Leung, T. K., M. C. Burl, and P. Perona. 1995. “Finding Faces in Cluttered Scenes Using Random Labeled Graph Matching.” In Iccv, 637–44.

Levin, A., Y. Weiss, F. Durand, and W. T. Freeman. 2011. “Efficient Marginal Likelihood Optimization in Blind Deconvolution.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Levoy, Marc. 2010. “Optics i: Lenses and Apertures.” https://graphics.stanford.edu/courses/cs178-13/lectures/optics1-09apr13.pdf.

Levoy, Marc, and Pat Hanrahan. 1996. “Light Field Rendering.” In Siggraph, 31--42.

Li, Fei-Fei, Marco Andreeto, Marc’Aurelio Ranzato, and Pietro Perona. 2022. “Caltech 101.” CaltechDATA.

Li, Junnan, Dongxu Li, Silvio Savarese, and Steven Hoi. 2023. “BLIP-2: Bootstrapping Language-Image Pre-Training with Frozen Image Encoders and Large Language Models.” In Icml.

Li, Zhengqi, and Noah Snavely. 2018. “MegaDepth: Learning Single-View Depth Prediction from Internet Photos.” In Cvpr.

Lin, T., P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie. 2017. “Feature Pyramid Networks for Object Detection.” In Cvpr, 936–44.

Lin, Tsung-Yi, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, and Piotr Dollár. 2014. “Microsoft COCO: Common Objects in Context.” In Eccv, 740–55.

Lindeberg, Tony. 1994. Scale-Space Theory in Computer Vision. Kluwer Academic Publishers.

Linsker, Ralph. 1988. “Self-Organization in a Perceptual Network.” Computer 21 (3): 105–17.

Lipson, L., Z. Teed, and J. Deng. 2021. “RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching.” In International Conference on 3D Vision (3DV).

Liu, Ce, William T. Freeman, Edward H. Adelson, and Yair Weiss. 2008. “Human-Assisted Motion Annotation.” In Cvpr, 1–8.

Liu, Rosanne, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric Frank, Alex Sergeev, and Jason Yosinski. 2018. “An Intriguing Failing of Convolutional Neural Networks and the Coordconv Solution.” https://arxiv.org/abs/1807.03247.

Livingstone, M S, and David H. Hubel. 1984. “Anatomy and Physiology of a Color System in the Primate Visual Cortex.” In The Journal of Neuroscience.

Long, Jonathan, Evan Shelhamer, and Trevor Darrell. 2015. “Fully Convolutional Networks for Semantic Segmentation.” In Cvpr, 3431–40.

Longuet-Higgens, H. C. 1981. “A Computer Algorithm for Reconstructing a Scene from Two Projections.” Nature 293: 133–35.

Loop, Charles, and Zhengyou Zhang. 1999. “Computing Rectifying Homographies for Stereo Vision.” In Cvpr, 1:125–31.

Lowe, D. G. 2004. “Distinctive Image Features from Scale-Invariant Keypoints.” International Journal of Computer Vision 60 (2): 91–110.

Lowe, David G. 1985. Perceptual Organization and Visual Recognition. Kluwer Academic Publishers.

Lucas, B. D., and T. Kanade. 1981. “An Iterative Image Registration Technique with an Application to Stereo Vision.” In Proceedings of Imaging Understanding Workshop, 121–30.

Maaten, Laurens van der, and Geoffrey Hinton. 2008. “Visualizing Data Using t-SNE.” Journal of Machine Learning Research 9 (Nov): 2579–2605.

MacKay, David J. C. 2003. Information Theory, Inference and Learning Algorithms. Cambridge University Press.

Madison, Cindee, William Thompson, Daniel Kersten, Peter Shirley, and Brian Smits. 2001. “Use of Interreflection and Shadow for Surface Contact.” Perception & Psychophysics 63 (2): 187–94.

Malik, J., and P. Perona. 1990. “Preattentive Texture Discrimination with Early Vision Mechanisms.” Journal of the Optical Society of America A 7: 923–31.

Malisiewicz, Tomasz, and Alexei A. Efros. 2009. “Beyond Categories: The Visual Memex Model for Reasoning about Object Relationships.” In Nips.

Marr, D. 1982. Vision: A Computational Investigation into the Human Representation and Processing of Visual Information. W. H. Freeman, San Francisco.

———. 2010. Vision. Cambridge, MA: MIT Press.

Marr, D. C., and E. Hildreth. 1980. “Theory of Edge Detection.” Proceedings of the Royal Society B 207: 187–217.

Martens, Rhonda. 2001. “Optics: Paralipomena to Witelo, and Optical Part of Astronomy. Johannes Kepler. Translated by William h. Donahue.” Isis 92 (3): 607–8.

Matas, Jiri, Ondrej Chum, Martin Urban, and Tomas Pajdla. 2004. “Robust Wide Baseline Stereo from Maximally Stable Extremal Regions.” Image and Vision Computing 22 (September): 761–67.

Matheron, G. 1975. Random Sets and Integral Geometry. Wiley.

Matusik, Wojciech, Hanspeter Pfister, Matt Brand, and Leonard McMillan. 2002. “Directional Reflectance and Emissivity of an Opaque Surface.” ACM Transactions on Graphics 22.

Max, Nelson. 1995. “Optical Models for Direct Volume Rendering.” IEEE Transactions on Visualization and Computer Graphics 1 (2): 99–108.

Max, Nelson, and Min Chen. 2005. “Local and Global Illumination in the Volume Rendering Integral.” Lawrence Livermore National Lab.(LLNL), Livermore, CA (United States).

Mayer, N., E. Ilg, P. Hausser, P. Fischer, D. Cremers abd A. Dosovitskiy, and T. Brox. 2016. “A Large Dataset to Train Convolutional Networks for Disparity, Optical Flow, and Scene Flow Estimation.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

McCann, J. J., S. P. McKee, and T. H. Taylor. 1976. “Quantitative Studies in Retinex Theory: A Comparison Between Theoretical Predictions and Observer Responses to the ’Color Mondrian’ Experiments.” Vision Research 16: 445–58.

McManus, Jim. 2022. “Real Titanic 3D.” http://www.realtitanic3d.com/.

Mead, Carver. 1989. Analog VLSI and Neural Systems. Boston: Addison-Wesley Longman Publishing.

Mermin, N. David. 1989. “What’s Wrong with These Equations?” Physics Today.

Mersereau, R. M. 1979. “The Processing of Hexagonally Sampled Two-Dimensional Signals.” Proceedings of the IEEE 67 (6): 930–49.

Mikolajczyk, Krystian, and Cordelia Schmid. 2002. “An Affine Invariant Interest Point Detector.” In Eccv, edited by Anders Heyden, Gunnar Sparr, Mads Nielsen, and Peter Johansen, 128–42.

Mikolajczyk, K., and C. Schmid. 2001. “Indexing Based on Scale Invariant Interest Points.” In Iccv, 2:525.

———. 2005. “A Performance Evaluation of Local Descriptors.” Pami 27 (10): 1615–30.

Mildenhall, Ben, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron, Ravi Ramamoorthi, and Ren Ng. 2020. “Nerf: Representing Scenes as Neural Radiance Fields for View Synthesis.” In Eccv, 405–21. Springer.

Minnaert, Marcel. 1993. Light and Color in the Outdoors. New York: Springer New York.

———. 2012. Light and Color in the Outdoors. New York: Springer.

Minsky, Marvin, and Seymour Papert. 1969. Perceptrons. Cambridge, MA: MIT Press.

Moghaddam, B., and A. P. Pentland. 1997. “Probabilistic Visual Learning for Object Representation.” IEEE Transactions on Pattern Analysis and Machine Intelligence 19 (7): 696–710.

Mozur, P. 2019. “One Month, 500,000 Face Scans.” New York Times.

Mullainathan, Sendhil. 2019. “Biased Algorithms Are Easier to Fix Than Biased People.” New York Times.

Mundy, Joseph. 2006. “Object Recognition in the Geometric Era: A Retrospective.” In Toward Category Level Object Recognition, 4170:3–28.

Murakami, I., A. Kitaoka, and H. Ashida. 2010. “Artificial Image Oscillation Enhances the Rotating Snakes Illusion.” Journal of Vision 6 (6): 551.

Murphy, Christopher J., and Howard C. Howland. 1986. “On the Gekko Pupil and Scheiner’s Disc.” Vision Research 26 (5): 815–17.

Murphy, Kevin P. 2022. Probabilistic Machine Learning: An Introduction. Cambridge, MA: MIT Press.

Murphy, Kevin, Yair Weiss, and Michael I. Jordan. 1999. “Loopy Belief Propagation for Approximate Inference: An Empirical Study.” In Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence, 467–75.

Nathan Silberman, Pushmeet Kohli, Derek Hoiem, and Rob Fergus. 2012. “Indoor Segmentation and Support Inference from RGBD Images.” In Eccv.

Navez, B. n.d. https://commons.wikimedia.org/w/index.php?curid=855487.

Nayar, Shree K., Katsushi Ikeuchi, and Takeo Kanade. 1991. “Shape from Interreflections.” Ijcv 6 (3): 173–95.

Necker, L. A. 2005. “Observations on Some Remarkable Optical Phaenomena Seen in Switzerland; and on an Optical Phaenomenon Which Occurs on Viewing a Figure of a Crystal or Geometrical Solid.” London and Edinburgh Philosophical Magazine and Journal of Science 5 (1): 329–37.

Nene, Samer A., Shree K. Nayar, and Hiroshi Murase. 1996. “Columbia Object Image Library (COIL-20).” Department of Computer Science, Columbia University.

Ng, Andrew Y., Michael I. Jordan, and Yair Weiss. 2001. “On Spectral Clustering: Analysis and an Algorithm.” In Nips, 849–56.

Nicodemus, Fred E. 1965. “Directional Reflectance and Emissivity of an Opaque Surface.” Applied Optics 4: 767–75.

Noble, Safiya Umoja. 2018. Algorithms of Oppression. NYU Press, Inc.

Olah, Chris, Alexander Mordvintsev, and Ludwig Schubert. 2017. “Feature Visualization.” Distill 2 (11): e7.

Oliva, A., and A. Torralba. 2001. “Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope.” International Journal of Computer Vision 42(3): 145–75.

Oord, Aaron van den, Yazhe Li, and Oriol Vinyals. 2018. “Representation Learning with Contrastive Predictive Coding.” https://arxiv.org/abs/1807.03748.

OpenAI. 2023. “GPT-4 Technical Report.” https://arxiv.org/abs/2303.08774.

———. 2024. “GPT-4V(ision) System Card.”

Oppenheim, A. V., and J. S. Lim. 1981. “The Importance of Phase in Signals.” Proceedings of the IEEE 69 (5): 529–41.

Oppenheim, Alan V., Alan S. Willsky, and S. Hamid Nawab. 1996. Signals and Systems, 2nd Ed. Hoboken, NJ: Prentice-Hall.

Ordonez, Vicente, Girish Kulkarni, and Tamara Berg. 2011. “Im2text: Describing Images Using 1 Million Captioned Photographs.” In Nips. Vol. 24.

Oren, M., and S. K. Nayar. 1994. “Generalization of Lambert’s Reflection Model.” In ACM SIGGRAPH: Proceedings of the Annual Conference on Computer Graphics and Interactive Techniques, 239–46.

Orwell, George. 1948. 1984. Prabhat Prakashan.

Owen, Art B. 2013. Monte Carlo Theory, Methods and Examples. https://artowen.su.domains/mc/.

Palmer, Irvin. 1994. “Rethinking Perceptual Organization: The Role of Uniform Connectedness.” Psychonomic Bulletin & Review. 1 (1).

Palmer, S., E. Rosch, and P. Chase. 1981. “Canonical Perspective and the Perception of Objects.” International Symposium on Attention and Performance (Attention and Performance IX)., 135–51.

Palmer, Stephen E. 1999. Vision Science: Photons to Phenomenology. Cambridge, MA: MIT Press.

Pantone. 2020. “Munsell USDA Frozen French Fry Standard.” https://www.pantone.com/products/munsell/munsell-usda-frozen-french-fry-standard.

Papert, Seymour. 1966. “The Summer Vision Project.” MIT AI Memo 100. Massachusetts Institute of Technology, Project Mac.

Parikh, Devi, and Dhruv Batra. 2018. “CVPR18 Workshop Panel: How to Be a Good Citizen of the CVPR Community.”

Paris, Sylvain, Pierre Kornprobst, Jack Tumblin, and Frédo Durand. 2009. Bilateral Filtering: Theory and Applications. Now Publishers.

Parish, Yoav I. H., and Pascal Müller. 2001. “Procedural Modeling of Cities.” In Proceedings of the 28th Annual Conference on Computer Graphics and Interactive Techniques, 301–8.

Pathak, Deepak, Pulkit Agrawal, Alexei A Efros, and Trevor Darrell. 2017. “Curiosity-Driven Exploration by Self-Supervised Prediction.” In Icml, 2778–87.

Pathak, Deepak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, and Alexei A Efros. 2016. “Context Encoders: Feature Learning by Inpainting.” In Cvpr, 2536–44.

Pell, Denis G, Patrick Cavanagh, Robert Desimone, Bosco Tjan, and Anne Treisman. 2007. “Crowding: Including Illusory Conjunctions, Surround Suppression, and Attention.” Journal of Vision 7 (2): 1.

Pentland, A. P. 1990. “Linear Shape from Shading.” International Journal of Computer Vision 1 (4): 153–62.

Perona, P. 1995. “Deformable Kernels for Early Vision.” IEEE Transactions on Pattern Analysis and Machine Intelligence 17 (5): 488–99.

Perona, P., and J. Malik. 1990a. “Detecting and Localizing Edges Composed of Steps, Peaks and Roofs.” In Proceedings of the IEEE/CVF International Conference on Computer Vision.

———. 1990b. “Scale-Space and Edge Detection Using Anisotropic Diffusion.” IEEE Transactions on Pattern Analysis and Machine Intelligence 12 (7): 629–39.

Phong, Bui Tuong. 1975. “Illumination for Computer Generated Pictures.” Commun. ACM 18 (6): 311–17.

Photos, MN. n.d. http://www.flickr.com/photos/mnsomero/2738807250/.

Plato. 360 BCE. Translated by Benjamin Jowett. https://classics.mit.edu/Plato/timaeus.html.

Poggio, T., V. Torre, and C. Koch. 1985. “Computational Vision and Regularization Theory.” Nature 317 (26): 314–139.

Pollefeys, M., R. Koch, and L. Van Gool. 1999. “A Simple and Efficient Rectification Method for General Motion.” In Proceedings of the IEEE/CVF International Conference on Computer Vision, 496–501.

Portilla, J., and E. P. Simoncelli. 2000. “A Parametric Texture Model Based on Joint Statistics of Complex Wavelet Coefficients.” International Journal of Computer Vision 40 (1): 49–71.

Prince, S. J. D. 2012. Computer Vision: Models Learning and Inference. Cambridge University Press.

Radford, Alec, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, et al. 2021. “Learning Transferable Visual Models from Natural Language Supervision.” In Icml, 8748–63.

Ramaswamy, Vikram V., William T. Freeman, Fei-Fei Li, Pietro Perona, Antonio Torralba, and Olga Russakovsky. 2021. “The Future of Computer Vision Datasets.” Computer Vision and Pattern Recognition Workshop.

Ramaswamy, Vikram V., Sunnie S. Y. Kim, and Olga Russakovsky. 2021. “Fair Attribute Classification Through Latent Space de-Biasing.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Ramesh, Aditya, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark Chen. 2022. “Hierarchical Text-Conditional Image Generation with Clip Latents.” https://arxiv.org/abs/2204.06125.

Ramesh, Aditya, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss, Alec Radford, Mark Chen, and Ilya Sutskever. 2021. “Zero-Shot Text-to-Image Generation.” In Icml, 8821–31.

Ramón y Cajal, S. 1893. “La Rétine Des Vertébrés.” Cellule 9: 119–255.

Ranftl, René, Alexey Bochkovskiy, and Vladlen Koltun. 2021. “Vision Transformers for Dense Prediction.” In Iccv.

Ranftl, René, Katrin Lasinger, David Hafner, Konrad Schindler, and Vladlen Koltun. 2022. “Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-Shot Cross-Dataset Transfer.” Pami 44 (3).

Recasens, Adrià, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian Strub, Corentin Tallec, Mateusz Malinowski, et al. 2021. “Broaden Your Views for Self-Supervised Video Learning.” Iccv.

Redmon, J., S. Divvala, R. Girshick, and A. Farhadi. 2016. “You Only Look Once: Unified, Real-Time Object Detection.” In Cvpr, 779–88.

Ren, Shaoqing, Kaiming He, Ross B. Girshick, and Jian Sun. 2015. “Faster r-CNN: Towards Real-Time Object Detection with Region Proposal Networks.” In Nips, 91–99.

Rescorla, Robert A. 1972. “A Theory of Pavlovian Conditioning: Variations in the Effectiveness of Reinforcement and Non-Reinforcement.” Classical Conditioning, Current Research and Theory 2: 64–69.

Roberts, Lawrence G. 1963. Machine Perception of Three-Dimensional Solids. Outstanding Dissertations in the Computer Sciences. New York: Garland Publishing.

Rodríguez-Muñoz, Adrián, and Antonio Torralba. 2022. “Aliasing Is a Driver of Adversarial Attacks.” https://arxiv.org/abs/2212.11760.

Rogaway, P. 2015. “The Moral Character of Cryptographic Work?” International Association for Cryptologic Research.

Rombach, Robin, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Björn Ommer. 2022. “High-Resolution Image Synthesis with Latent Diffusion Models.” In Cvpr, 10684–95.

Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. 2015. “U-Net: Convolutional Networks for Biomedical Image Segmentation.” In International Conference on Medical Image Computing and Computer-Assisted Intervention, 234–41. Springer.

Rosch, Eleanor. 1978. Principles of Categorization. De Gruyter Mouton.

Rosch, Eleanor, Carolyn B. Mervis, Wayne D. Gray, D M Johnson, and Penny Boyes-Braem. 1976. “Basic Objects in Natural Categories.” Cognitive Psychology 8: 382–439.

Rosenblatt, Frank. 1958. “The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain.” Psychological Review 65 (6): 386.

Rowley, Henry, Shumeet Baluja, and Takeo Kanade. 1996. “Neural Network-Based Face Detection.”

Rublee, Ethan, Vincent Rabaud, Kurt Konolige, and Gary Bradski. 2011. “ORB: An Efficient Alternative to SIFT or SURF.” In Iccv, 2564–71.

Ruderman, D. L. 1997. “Origins of Scaling in Natural Images.” Vision Research 37 (23): 3385–98.

Rumelhart, D. E., and J. L. McClelland, eds. 1986. Parallel Distributed Processing. Cambridge, MA: MIT Press.

Rumelhart, David E., Geoffrey E. Hinton, and Ronald J. Williams. 1985. “Learning Internal Representations by Error Propagation.” California Univ San Diego Inst for Cognitive Science.

Russakovsky, Olga, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, et al. 2015. “Imagenet Large Scale Visual Recognition Challenge.” Ijcv 115 (3): 211–52.

Russell, B. C., A. Torralba, K. P. Murphy, and W. T. Freeman. 2008. “LabelMe: A Database and Web-Based Tool for Image Annotation.” International Journal of Computer Vision 77: 157–73.

Salimans, Tim, Jonathan Ho, Xi Chen, Szymon Sidor, and Ilya Sutskever. 2017. “Evolution Strategies as a Scalable Alternative to Reinforcement Learning.” https://arxiv.org/abs/1703.03864.

Sattigeri, Prasanna, Samuel C. Hoffman, Vijil Chenthamarakshan, and Kush R. Varshney. 2019. “Fairness GAN: Generating Datasets with Fairness Properties Using a Generative Adversarial Network.” In Intl. Conf. On Learning Representations (ICLR) Workshop.

Savinov, N., A. Seki, L. Ladicky, T. Sattler, and M. Pollefeys. 2017. “Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection.” In Cvpr, 3929–37.

Saxena, Ashutosh, Sung H. Chung, and Andrew Y. Ng. 2008. “3-d Depth Reconstruction from a Single Still Image.” Ijcv 76 (1): 53–69.

Scharstein, D., and R. Szeliski. 2002. “A Taxonomy and Evaluation of Dense Two-Frame Stereo Correspondence Algorithms.” International Journal of Computer Vision 47.

Schmid, Cordelia, Roger Mohr, and Christian Bauckhage. 2000. “Evaluation of Interest Point Detectors.” Ijcv 37 (2): 151–72.

Schölkopf, Bernhard, Alexander Smola, and Klaus-Robert Müller. 1998. “Nonlinear Component Analysis as a Kernel Eigenvalue Problem.” Neural Computation 10 (5): 1299–319.

Schrimpf, Martin, Jonas Kubilius, Michael J Lee, N Apurva Ratan Murty, Robert Ajemian, and James J DiCarlo. 2020. “Integrative Benchmarking to Advance Neurally Mechanistic Models of Human Intelligence.” Neuron.

Shannon, C. E. 1948. “A Mathematical Theory of Communication.” The Bell System Technical Journal 27 (3): 379–423.

Shapin, S. 2019. “A Theorist of (Not Quite) Everything.” The New York Review of Books.

Shepard, Roger N. 1990. Mind Sights: Original Visual Illusions, Ambiguities, and Other Anomalies, with a Commentary on the Play of Mind in Perception and Art. New York: W.H. Freeman; Co.

Shepard, Roger N., and Jacqueline Metzler. 1971. “Mental Rotation of Three-Dimensional Objects.” Science 171 (3972): 701–3.

Sherrington, C S. 1906. “Observations on the Scratch-Reflex in the Spinal Dog.” The Journal of Physiology 34 (1-2): 1–50.

Shi, Jianbo, and Jitendra Malik. 2000. “Normalized Cuts and Image Segmentation.” Pami 22 (8): 888–905.

Shi, Jianbo, and Tomasi. 1994. “Good Features to Track.” In Cvpr, 593–600.

Shi, J., and J. Malik. 2000. “Normalized Cuts and Image Segmentation.” IEEE Transactions on Pattern Analysis and Machine Intelligence 22 (8): 888–905.

Shirley, Peter, Michael Ashikhmin, and Steve Marschner. 2009. Fundamentals of Computer Graphics. AK Peters/CRC Press.

Silver, David, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre, George Van Den Driessche, Julian Schrittwieser, et al. 2016. “Mastering the Game of Go Ith Deep Neural Networks and Tree Search.” Nature 529 (7587): 484–89.

Simoncelli, E. P. 2005. “Statistical Modeling of Photographic Images.” In Handbook of Image and Video Processing, 431–41. Academic Press.

Simoncelli, E. P., and E. H. Adelson. 1996. “Noise Removal via Bayesian Wavelet Coring.” In International Conference on Image Processing, 379–82.

Simoncelli, E. P., and W. T. Freeman. 1995. “The Steerable Pyramid: A Flexible Architecture for Multi-Scale Derivative Computation.” In International Conference on Image Processing.

Simoncelli, E. P., W. T. Freeman, E. H. Adelson, and D. J. Heeger. 1992. “Shiftable Multi-Scale Transforms.” IEEE Transactions on Information Theory 2 (38): 587–607.

Simoncelli, Eero P., and Edward H. Adelson. 1990. “Subband Image Coding with Hexagonal Quadrature Mirror Filters.” In Picture Coding Symposium.

Simonyan, Karen, and Andrew Zisserman. 2015. “Very Deep Convolutional Networks for Large-Scale Image Recognition.” In Iclr.

Sitzmann, Vincent, Julien N. P. Martel, Alexander W. Bergman, David B. Lindell, and Gordon Wetzstein. 2020. “Implicit Neural Representations with Periodic Activation Functions.” In Nips.

Smith, A. M. 2001. Alhacen’s Theory of Visual Perception: A Critical Edition, with English Translation and Commentary, of the First Three Books of Alhacen’s de Aspectibus, the Medieval Latin Version of Ibn Al-Haytham’s Kitab Al-Manazir. v. 91, pt. 4. American Philosophical Society.

Snavely, Noah, Steven M. Seitz, and Richard Szeliski. 2006. “Photo Tourism: Exploring Photo Collections in 3D.” In Siggraph, 835–46.

Snell, Jake, Karl Ridgeway, Renjie Liao, Brett D Roads, Michael C Mozer, and Richard S Zemel. 2017. “Learning to Generate Images with Perceptual Similarity Metrics.” In Icip, 4277–81. IEEE.

Soatto, Stefano. 2013. “Actionable Information in Vision.” In Machine Learning for Computer Vision, 17–48. Springer.

Sohl-Dickstein, Jascha, Eric Weiss, Niru Maheswaranathan, and Surya Ganguli. 2015. “Deep Unsupervised Learning Using Nonequilibrium Thermodynamics.” In Icml, 2256–65.

Sperling, George, Son-Hee Lyu, Chia-Huei Tseng, and Zhong-Lin Lu. 2017. “The Motion Standstill Illusion.” In The Oxford Compendium of Visual Illusions. Oxford University Press.

Spillmann, Lothar. 2014. “Receptive Fields of Visual Neurons: The Early Years.” Perception 43: 1145–76.

Srivastava, Rupesh Kumar, Klaus Greff, and Jürgen Schmidhuber. 2015. “Highway Networks.” https://arxiv.org/abs/1505.00387.

Steinman, Robert M, Zygmunt Pizlo, and Filip J Pizlo. 2000. “Phi Is Not Beta, and Why Wertheimer’s Discovery Launched the Gestalt Revolution.” Vision Research 40 (17): 2257–64.

Strunk, William, and E. B. White. 1999. The Elements of Style. Boston: Allyn; Bacon.

Sturm, Peter, and Bill Triggs. 1996. “A Factorization Based Algorithm for Multi-Image Projective Structure and Motion.” In Eccv, 709–20.

Surís, Dídac, Sachit Menon, and Carl Vondrick. 2023. “ViperGPT: Visual Inference via Python Execution for Reasoning.” In Iccv.

Sutskever, Ilya. n.d. https://twitter.com/ilyasut/status/1114658175272095744?s=20.

Sutton, Richard S, and Andrew G Barto. 2018. Reinforcement Learning: An Introduction. Cambridge, MA: MIT press.

Szegedy, Christian, Wojciech Zaremba, Ilya Sutskever, Joan Bruna, Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2014. “Intriguing Properties of Neural Networks.” In Iclr.

Szeliski, Richard. 2022. Computer Vision Algorithms and Applications. 2nd ed. Springer.

Tancik, Matthew, Pratul Srinivasan, Ben Mildenhall, Sara Fridovich-Keil, Nithin Raghavan, Utkarsh Singhal, Ravi Ramamoorthi, Jonathan Barron, and Ren Ng. 2020. “Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains.” In Nips, 33:7537–47.

Tarr, Michael J., and Steven Pinker. 1989. “Mental Rotation and Orientation-Dependence in Shape Recognition.” Cognitive Psychology 21: 233–82.

Telgarsky, Matus. 2016. “Benefits of Depth in Neural Networks.” In Conference on Learning Theory, 1517–39. PMLR.

Thomson, Judith Jarvis. 1985. “The Trolley Problem.” The Yale Law Journal 94 (6): 1395–415.

Tieu, K., and P. Viola. 2000. “Boosting Image Retrieval.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Tomasi, C., and T. Kanade. 1992. “Shape and Motion from Image Streams Under Orthography: A Factorization Method.” International Journal of Computer Vision 9 (2): 137–54.

Torralba, A., and A. Efros. 2011. “Unbiased Look at Dataset Bias.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Torralba, Antonio. 2009. “How Many Pixels Make an Image?” Visual Neuroscience 26 (1): 123–31.

Torralba, Antonio, Rob Fergus, and William T. Freeman. 2008. “80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition.” Pami 30 (11): 1958–70.

Torralba, Antonio, and William T. Freeman. 2014. “Accidental Pinhole and Pinspeck Cameras.” Ijcv 110 (2): 92–112.

Traer, James, and Josh H. McDermott. 2016. “Statistics of Natural Reverberation Enable Perceptual Separation of Sound and Space.” Proceedings of the National Academy of Sciences 113 (48): E7856–65.

Treisman, A M, and G Gelade. 1980. “A Feature-Integration Theory of Attention.” Cognit Psychol 12 (1): 97–136.

Trucco, Emanuele, and Alessandro Verri. 1998. Introductory Techniques for 3-d Computer Vision. USA: Prentice Hall PTR.

Turing, Alan M. 2009. Computing Machinery and Intelligence. Springer.

Tyleček, Radim, and Radim Šára. 2013. “Spatial Pattern Templates for Recognition of Objects with Regular Structure.” In Proceedings German Conference on Pattern Recognition. Saarbrucken, Germany.

Tzeng, Eric, Judy Hoffman, Kate Saenko, and Trevor Darrell. 2017. “Adversarial Discriminative Domain Adaptation.” In Cvpr, 7167–76.

Uijlings, Jasper R. R., Koen E. A. van de Sande, Theo Gevers, and Arnold W. M. Smeulders. 2013. “Selective Search for Object Recognition.” Ijcv 104 (2): 154–71.

Ullman, S., and Sydney Brenner. 1979. “The Interpretation of Structure from Motion.” Proceedings of the Royal Society of London. Series B. Biological Sciences 203 (1153): 405–26.

Ullman, Shimon. 2000. High-Level Vision. Cambridge, MA: MIT Press.

van der Schaaf, A., and J. H. van Hateren. 1996. “Modelling the Power Spectra of Natural Images: Statistics and Information.” Vision Research 36 (17): 2759–70.

Vanessaezekowitz. 1993. “Eye Cone Responses.” https://commons.wikimedia.org/wiki/File:Cones_SMJ2_E.svg.

Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. “Attention Is All You Need.” In Nips, 5998–6008.

Vincent, Pascal, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine Manzagol. 2008. “Extracting and Composing Robust Features with Denoising Autoencoders.” In Icml, 1096–1103.

Viola, P., and M. Jones. 2001. “Rapid Object Detection Using a Boosted Cascade of Simple Classifiers.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Waltz, David L. 1972. “Generating Semantic Descriptions from Drawings of Scenes with Shadows.” PhD Thesis, Artificial Intelligence Lab Memo. Massachusetts Institute of Technology.

Wandell, Brian. 1995. Foundations of Vision. Sinauer Assoc.

Wang, Dequan, Evan Shelhamer, Shaoteng Liu, Bruno Olshausen, and Trevor Darrell. 2020. “Fully Test-Time Adaptation by Entropy Minimization.” https://arxiv.org/abs/2006.10726.

Wang, J. Y. A., and E. H. Adelson. 1994. “Representing Moving Images with Layers.” IEEE Transactions on Image Processing 3 (5): 625–38.

Wang, Tongzhou, and Phillip Isola. 2020. “Understanding Contrastive Representation Learning Through Alignment and Uniformity on the Hypersphere.” In Icml, 9929–39.

Wang, Zeyu, Klint Qinami, Ioannis Christos Karakozis, Kyle Genova, Prem Nair, Kenji Hata, and Olga Russakovsky. 2020. “Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition.

Weber, M., M. Welling, and P. Perona. 2000. “Towards Automatic Discovery of Object Categories.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition, 2:101–8.

Wei, Xi, Tianzhu Zhang, Yan Li, Yongdong Zhang, and Feng Wu. 2020. “Multi-Modality Cross Attention Network for Image and Sentence Matching.” In Cvpr, 10941–50.

Weiss, Yair. 2001. “Deriving Intrinsic Images from Image Sequences.” In Iccv, 2:68–75.

Weiss, Yair, and Edward H. Adelson. 1998. “Slow and Smooth: A Bayesian Theory for the Combination of Local Motion Signals in Human Vision.” MIT.

Wertheimer, Max. 1912. “Experimentelle Studien Uber Das Sehen von Bewegung.” Zeitschrift Fur Psychologie 61.

Wiesel, T. N. 1982. “The Postnatal Development of the Visual Cortex and the Influence of Environment.” Nature 299: 583–91.

Wikipedia. 2021b. https://en.wikipedia.org/wiki/Image_rectification.

———. 2021a. https://en.wikipedia.org/wiki/Hermann_von_Helmholtz.

Williams, Lance. 1983. “Pyramidal Parametrics.” In Siggraph, 1–11.

Winston, Patrick. 2016. “How to Speak.” https://vimeo.com/101543862.

Witkin, A. P. 1981. “Recovering Surface Shape and Orientation from Texture.” Artificial Intelligence 17: 17–45.

Wolfe, Jeremy M. 2000. “Visual Attention.” Seeing, 335–86.

———. 2007. “Guided Search 4.0: Current Progress with a Model of Visual Search.” In Integrated Models of Cognitive Systems. Oxford University Press.

Wu, Chenfei, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, and Nan Duan. 2023. “Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models.” https://arxiv.org/abs/2303.04671.

Wu, Fa-Yueh. 1982. “The Potts Model.” Rev. Mod. Phys. 54 (1): 235–68.

Xian, Ke, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, and Zhenbo Luo. 2018. “Monocular Relative Depth Perception with Web Stereo Data Supervision.” In Cvpr, 311–20.

Yedidia, J. S., W. T. Freeman, and Y. Weiss. 2001. “Generalized Belief Propagation.” In Nips, 13:689–95.

Yi, Zili, Hao Zhang, Ping Tan, and Minglun Gong. 2017. “Dualgan: Unsupervised Dual Learning for Image-to-Image Translation.” In Iccv, 2849–57.

Zabih, R., and V. Komogorov. 2004. “What Energy Functions Can Be Minimized via Graph Cuts?” In European Conf. Computer Vision, 26:147–59.

Zbontar, J., and Y. LeCun. 2015. “Computing the Stereo Matching Cost with a Convolutional Neural Network.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition, 1592–99.

Zhang, Chiyuan, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol Vinyals. 2016. “Understanding Deep Learning Requires Rethinking Generalization.” https://arxiv.org/abs/1611.03530.

Zhang, Feihu, Victor Prisacariu, Ruigang Yang, and Philip HS Torr. 2019. “GA-Net: Guided Aggregation Net for End-to-End Stereo Matching.” In Proceedings of the IEEE/CVF Computer Vision and Pattern Recognition, 185–94.

Zhang, Feihu, Xiaojuan Qi, Ruigang Yang, Victor Prisacariu, Benjamin Wah, and Philip Torr. 2019. “Domain-Invariant Stereo Matching Networks.” In European Conference on Computer Vision.

Zhang, L., B. Curless, A. Hertzmann, and S. M. Seitz. 2003. “Shape and Motion Under Varying Illumination: Unifying Structure from Motion, Photometric Stereo, and Multi-View Stereo.” In Proceedings of the IEEE/CVF International Conference on Computer Vision.

Zhang, Lvmin, Anyi Rao, and Maneesh Agrawala. 2023. “Adding Conditional Control to Text-to-Image Diffusion Models.” In Iccv.

Zhang, Richard, Phillip Isola, and Alexei A Efros. 2016. “Colorful Image Colorization.” In Eccv, 649–66. Springer.

———. 2017. “Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction.” In Cvpr, 1058–67.

Zhang, Z. 2000. “A Flexible New Technique for Camera Calibration.” Pami 22 (11): 1330–34.

Zhao, Jieyu, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei Chang. 2017. “Men Also Like Shopping: Reducing Gender Bias Amplification Using Corpus-Level Constraints.” In Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP).

Zhou, Bolei, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba. 2015. “Object Detectors Emerge in Deep Scene CNNs.” Iclr.

Zhou, Tinghui, Matthew Brown, Noah Snavely, and David G. Lowe. 2017. “Unsupervised Learning of Depth and Ego-Motion from Video.” In Cvpr, 6612–19.

Zhou, Yichao, Haozhi Qi, Jingwei Huang, and Yi Ma. 2019. “NeurVPS: Neural Vanishing Point Scanning via Conic Convolution.” In Nips.

Zhu, Jun-Yan, Taesung Park, Phillip Isola, and Alexei A Efros. 2017. “Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks.” In Iccv.

Other Links