References
Abbas, Syed Ammar, and Andrew Zisserman. 2019. “A Geometric
Approach to Obtain a Bird’s Eye View from an Image.” In 2019
IEEE/CVF International Conference on Computer Vision Workshop
(ICCVW), 4095–4104.
Abbott, Edwin. 2009. Flatland. Broadview Press.
Acdx, user. 2009. “CIE 1931 XYZ Color Matching Functions.”
Adelson, E. H. 1992.
———. 1995.
———. 2000. “Lightness Perception and Lightness Illusions.”
In The New Cognitive Neurosciences, edited by M. Gazzaniga,
339–51. Cambridge, MA: MIT Press.
———. 2001. “On Seeing Stuff: The Perception of Materials by Humans
and Machines.” Proceedings of SPIE 4299 (June). https://doi.org/10.1117/12.429489.
Adelson, E. H., and J. R. Bergen. 1985. “Spatiotemporal Energy
Models for the Perception of Motion.” Journal of the Optical
Society of America A 2 (2): 284–99.
Adelson, E. H., and E. P. Simoncelli. 1987. “QMF
Pyramids: A New Class of Orthogonal Pyramid
Transform.” In Optical Society of America, Annual
Meeting. Vol. A4–13.
Adelson, Edward H. 1995. “Checkershadow Illusion.”
Adelson, Edward H., and James R. Bergen. 1991. “The Plenoptic
Function and the Elements of Early Vision.” In Computational
Models of Visual Processing., edited by Michael S. Landy and
Anthony J. Movshon, 3–20. Cambridge, MA: MIT Press.
Alayrac, Jean-Baptiste, Jeff Donahue, Pauline Luc, Antoine Miech, Iain
Barr, Yana Hasson, Karel Lenc, et al. 2022. “Flamingo: A Visual
Language Model for Few-Shot Learning.” In Nips,
35:23716–36.
Alda, Alan. 2014. “Alan Alda on Improvisation for Communication of
Science.” https://www.youtube.com/watch?v=j4XgjkXDxss.
Amarasinghe, Saman, and Deanna Montgomery. 2022. “Faculty Job
Talks: Tips from the Faculty.” https://www.eecs.mit.edu/career-opportunities-at-eecs/faculty-job-talks-tips-from-the-faculty/.
Andrychowicz, Marcin, Misha Denil, Sergio Gomez, Matthew W Hoffman,
David Pfau, Tom Schaul, Brendan Shillingford, and Nando De Freitas.
2016. “Learning to Learn by Gradient Descent by Gradient
Descent.” In Nips. Vol. 29.
Antol, Stanislaw, Aishwarya Agrawal, Jiasen Lu, Margaret Mitchell, Dhruv
Batra, C Lawrence Zitnick, and Devi Parikh. 2015. “VQA: Visual
Question Answering.” In Iccv, 2425–33.
Arbelaez, Pablo, Michael Maire, Charless Fowlkes, and Jitendra Malik.
2010. “Contour Detection and Hierarchical Image
Segmentation.” Pami 33 (5): 898–916.
Ariely, Dan. 2001. “Seeing Sets: Representation by Statistical
Properties.” Psychological Science 12 (2): 157–62.
Aristotle. 350 BC. On Sense and the Sensible. http://classics.mit.edu/Aristotle/sense.html.
Arnold, S. E. J., S. Faruq, V. Savolainen, P. W. McOwan, and L. Chittka.
2011. “FReD: The Floral Reflectance Database– a Web Portal for
Analysis of Flower Colour.” PLoS ONE 5 (12): e14287.
Atick, Joseph J., and A. Norman Redlich. 1990. “Towards a Theory
of Early Visual Processing.” Neural Computation 2 (3):
308–20.
———. 1992. “What Does the Retina Know about Natural
Scenes?” Neural Computation 4 (2): 196–210.
Avni, Amir. 2016. http://www.whatimade.today/our-frst-reddit-bot-coloring-b-2/.
Azizi, Shekoofeh, Simon Kornblith, Chitwan Saharia, Mohammad Norouzi,
and David J Fleet. 2023. “Synthetic Data from Diffusion Models
Improves Imagenet Classification.” https://arxiv.org/abs/2304.08466.
Ba, Jimmy Lei, Jamie Ryan Kiros, and Geoffrey E. Hinton. 2016.
“Layer Normalization.” https://arxiv.org/abs/1607.06450.
Badrinarayanan, Vijay, Ankur Handa, and Roberto Cipolla. 2015.
“SegNet: A Deep Convolutional Encoder-Decoder Architecture for
Robust Semantic Pixel-Wise Labelling.” Pami 39: 2481–95.
Bahng, Hyojin, Ali Jahanian, Swami Sankaranarayanan, and Phillip Isola.
2022. “Exploring Visual Prompts for Adapting Large-Scale
Models.” https://arxiv.org/abs/2203.17274.
Baker, Simon, Stefan Roth, Daniel Scharstein, Michael J. Black, J. P.
Lewis, and Richard Szeliski. 2007. “A Database and Evaluation
Methodology for Optical Flow.” In Proceedings of the IEEE/CVF
International Conference on Computer Vision, 1–8.
Balakrishnan, G., Y. Xiong, W. Xia, and P. Perona. 2020. “Towards
Causal Benchmarking of Bias in Face Analysis Algorithms.” In
European Conference on Computer Vision.
Ballard, Dana H. 1987. “Modular Learning in Neural
Networks.” In Aaai, 647:279–84.
Bansal, Arpit, Eitan Borgnia, Hong-Min Chu, Jie S Li, Hamid Kazemi,
Furong Huang, Micah Goldblum, Jonas Geiping, and Tom Goldstein. 2022.
“Cold Diffusion: Inverting Arbitrary Image Transforms Without
Noise.” https://arxiv.org/abs/2208.09392.
Bar, Amir, Yossi Gandelsman, Trevor Darrell, Amir Globerson, and Alexei
Efros. 2022. “Visual Prompting via Image Inpainting.” In
Nips, 35:25005–17.
Barber, G. 2019. “The Viral App That Labels You Isn’t Quite What
You Think.” Wired.
Barnes, C., E. Shechtman, A. Finkelstein, and D. B. Goldman. 2009.
“PatchMatch: A Randomized Correspondence Algorithm for Structural
Image Editing.” In ACM SIGGRAPH: Proceedings of the Annual
Conference on Computer Graphics and Interactive Techniques.
Barocas, Solon, Moritz Hardt, and Arvind Narayanan. 2019. Fairness
and Machine Learning. fairmlbook.org.
Barron, Jonathan T. 2015. “Convolutional Color Constancy.”
In Proceedings of the IEEE/CVF International Conference on Computer
Vision.
Barron, Jonathan T., and Jitendra Malik. 2015. “Shape,
Illumination, and Reflectance from Shading.” Pami 37:
1670–87.
Barrow, H. G., and J. M. Tenenbaum. 1978. “Recovering Intrinsic
Scene Characteristics from Images.” In Computer Vision
Systems, edited by A. R. Hanson and E. M. Riseman, 3–26. New York:
Academic Press.
Baudes, A., B. Coll, and J.-M. Morel. 2011. “Non-Local Means
Denoising.” In Image Processing on Line. Vol. 1.
Bay, H., T. Tuytelaars, and L. Van Gool. 2006. “SURF: Speeded up
Robust Features.” In Eccv, 404–17.
Belkin, Mikhail, Daniel Hsu, Siyuan Ma, and Soumik Mandal. 2018.
“Reconciling Modern Machine Learning and the Bias-Variance
Trade-Off.” https://arxiv.org/abs/1812.11118.
Bengio, Yoshua, Aaron Courville, and Pascal Vincent. 2013.
“Representation Learning: A Review and New Perspectives.”
Pami 35 (8): 1798–828.
Benjamin, Ruha. 2019. Race After Technology. Polity.
Bennett, Cynthia L., Cole Gleason, Morgan Klaus Scheuerman, Jeffrey P.
Bigham, Anhong Guo, and Alexandra To. 2021. “’It’s Complicated’:
Negotiating Accessibility and (Mis)representation in Image Descriptions
of Race, Gender, and Disability.” In CHI 2021.
Bergen, J. R., and E. H. Adelson. 1988. “Visual Texture
Segmentation and Early Vision.” Nature 333: 363–64.
Biederman, I. 1987. “Recognition by Components - a Theory of Human
Image Understanding.” Psychological Review 94 (2).
Biederman, Irving. 1976. “On Processing Information from a Glance
at a Scene: Some Implications for a Syntax and Semantics of Visual
Processing.” In Proceedings of the ACM/SIGGRAPH Workshop on
User-Oriented Design of Interactive Graphics Systems, 75–88.
Binford, Thomas O. 1971. “Visual Perception by Computer.”
In Proceedings of the IEEE Conference on Systems and
Control (Miami, FL).
Birhane, A., and V. U. Prabhu. 2021. “Large Image Datasets: A
Pyrrhic Win for Computer Vision?” In IEEE/CVF Winter
Conference on Applications of Computer Vision.
Bishop, C. M. 2006. Pattern Recognition and Machine Learning.
Springer-Verlag.
Blake, A., P. Kohli, and C. Rother. 2011. Markov Random Fields for
Vision and Image Processing. Cambridge, MA: MIT Press.
Bleasdale, Cecilia. 2015. https://en.wikipedia.org/wiki/The_dress.
Bouman, Katherine L., Vickie Ye, Adam B. Yedidia, Fredo Durand, Gregory
W. Wornell, Antonio Torralba, and William T. Freeman. 2018.
“Turning Corners into Cameras: Principles and Methods.” In
Iccv.
Bourlard, Hervé, and Yves Kamp. 1988. “Auto-Association by
Multilayer Perceptrons and Singular Value Decomposition.”
Biological Cybernetics 59 (4): 291–94.
Boyer, Carl B. 1946. “Aristotelian References to the Law of
Reflection.” Isis 36 (2): 92–95.
Brainard, D. H., and W. T. Freeman. 1997. “Bayesian Color
Constancy.” Journal of the Optical Society of America A
14 (7): 1393–1411.
Brainard, David H., and Anya C. Hurlbert. 2015. “Colour Vision:
Understanding #TheDress.” Current Biology 25: R551–54.
Brock, Andrew, Jeff Donahue, and Karen Simonyan. 2019. “Large
Scale GAN Training for High Fidelity Natural Image Synthesis.”
International Conference on Learning Representations.
Brooks, Tim, Aleksander Holynski, and Alexei A Efros. 2023.
“Instructpix2pix: Learning to Follow Image Editing
Instructions.” In Cvpr.
Brown, Tom, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan,
Prafulla Dhariwal, Arvind Neelakantan, et al. 2020. “Language
Models Are Few-Shot Learners.” In Nips, 33:1877–1901.
Buades, A., B. Coll, and J.-M. Morel. 2005. “A Non-Local Algorithm
for Image Denoising.” In Cvpr, 2:60–65 vol. 2.
Buolamwini, J., and T. Gebru. 2018. “Intersectional Accuracy
Disparities in Commercial Gender Classification.” In
Proceedings of Machine Learning Research Conference on Fairness,
Accountability, and Transparency., 81:1–15.
Burt, P. J., and E. H. Adelson. 1983. “The Laplacian
Pyramid as a Compact Image Code.” IEEE Transactions on
Communications 31 (4): 532–40.
Burton, Harry Edwin. 1945. “The Optics of Euclid.” J.
Opt. Soc. Am. 35 (5): 357–72.
Butler, D. J., J. Wulff, G. B. Stanley, and M. J. Black. 2012. “A
Naturalistic Open Source Movie for Optical Flow Evaluation.” In
Eccv, edited by A. Fitzgibbon et al., 611–25. Part IV, LNCS
7577. Springer-Verlag.
Canny, J. F. 1986. “A Computational Approach to Edge
Detection.” IEEE Transactions on Pattern Analysis and Machine
Intelligence 8 (6): 679–98.
Caron, Mathilde, Hugo Touvron, Ishan Misra, Hervé Jégou, Julien Mairal,
Piotr Bojanowski, and Armand Joulin. 2021. “Emerging Properties in
Self-Supervised Vision Transformers.” In Iccv, 9650–60.
Cavanagh, P. 1996. “Vision Is Getting Easier Every Day.”
Perception 24: 1227–32.
Cavazos, Jacqueline G., P. Jonathon Phillips, Carlos D. Castillo, and
Alice J. O’Toole. 2021. “Accuracy Comparison Across Face
Recognition Algorithms: Where Are We on Measuring Race Bias?”
IEEE Transactions on Biometrics, Behavior, and Identity Science
3: 101–11.
Chai, Lucy, Jun-Yan Zhu, Eli Shechtman, Phillip Isola, and Richard
Zhang. 2021. “Ensembling with Deep Generative Views.” In
Cvpr, 14997–5007.
Chang, Chin-Kai, Jiaping Zhao, and Laurent Itti. 2018. “DeepVP:
Deep Learning for Vanishing Point Detection on 1 Million Street View
Images.” In 2018 IEEE International Conference on Robotics
and Automation (ICRA), 1–8.
Chang, Jia-Ren, and Yong-Sheng Chen. 2018. “Pyramid Stereo
Matching Network.” In Proceedings of the IEEE/CVF Computer
Vision and Pattern Recognition.
Chechik, Gal, Varun Sharma, Uri Shalit, and Samy Bengio. 2010.
“Large Scale Online Learning of Image Similarity Through
Ranking.” Journal of Machine Learning Research 11 (3).
Chehikian, A., and James Crowley. 1991. “Fast Computation of
Optimal Semi-Octave Pyramids.” Scia, 18–27.
Chen, Mark, Alec Radford, Rewon Child, Jeffrey Wu, Heewoo Jun, David
Luan, and Ilya Sutskever. 2020. “Generative Pretraining from
Pixels.” In Icml, 1691–1703.
Chen, Ting, Simon Kornblith, Mohammad Norouzi, and Geoffrey Hinton.
2020. “A Simple Framework for Contrastive Learning of Visual
Representations.” In Icml, 1597–607.
CIE. 1931. “CIE 1931 XYZ Color Matching Functions.” https://commons.wikimedia.org/wiki/File:CIE1931_RGBCMF.png.
Cohen, Adam L. 1982. “Anti-Pinhole Imaging.” Optica
Acta: Intl. J. Of Optics 29 (1).
Cooley, James W., and John W. Tukey. 1965. “An Algorithm for the
Machine Calculation of Complex Fourier Series.” Mathematics
of Computation 19: 297–301.
Cortes, Corinna, and Vladimir Vapnik. 1995. “Support-Vector
Networks.” Machine Learning 20: 273–97.
Coughlan, James, and Alan L. Yuille. 1999. “Manhattan World:
Compass Direction from a Single Image by Bayesian Inference.” In
Iccv, 941–47.
Criminisi, Antonio. 1999. “Accurate Visual Metrology from Single
and Multiple Uncalibrated Images.” PhD thesis.
Csurka, G., C. Bray, C. Dance, and L. Fan. 2004. “Visual
Categorization with Bags of Keypoints.” Workshop on
Statistical Learning in Computer Vision, ECCV, 1–22.
Cummings, M. L. 2004. “Automation Bias in Intelligent Time
Critical Decision Support Systems.” In AIAA Third Intelligent
Systems Conference.
Curcio, Christine A., Kenneth R. Sloan, Robert E. Kalina, and Anita E.
Hendrickson. 1990. “Human Photoreceptor Topography.”
Journal of Comparative Neurology 292 (4): 497–523.
Curcio, Christine A., Kenneth R. Sloan, Orin S. Packer, Anita
Hendrickson, and Robert E. Kalina. 1987. “Distribution of Cones in
Human and Monkey Retina: Individual Variability and Radial
Asymmetry.” Science 236 4801: 579–82.
Curless, Brian, and Marc Levoy. 1996. “A Volumetric Method for
Building Complex Models from Range Images.” In Proceedings of
the 23rd Annual Conference on Computer Graphics and Interactive
Techniques, 303–12.
Cybenko, G. 1989. “Approximation by Superpositions of a Sigmoidal
Function.” Mathematics of Control, Signals and Systems 2
(4): 303–14.
d’Alessandro, Brian, Cathy O’Neil, and Tom LaGatta. 2017.
“Conscientious Classification: A Data Scientist’s Guide to
Discrimination-Aware Classification.”
Dalal, N., and B. Triggs. 2005. “Histograms of Oriented Gradients
for Human Detection.” In Proceedings of the IEEE/CVF Computer
Vision and Pattern Recognition.
Dalmotas, Dainius J., Regina M. Hurley, and Alan German. 1985.
“Air Bag Deployments Involving Restrained Occupants.”
SAE Transactions 104 (6): 1507–12.
Darrell, Trevor, and Eero Simoncelli. 1993. “On the Use of
’Nulling’ Filters to Separate Transparent Motions.” In
Cvpr.
Daugman, J. G. 1989. “Entropy Reduction and Decorrelation in
Visual Coding by Oriented Neural Receptive Fields.” IEEE
Transactions on Biomedical Engineering 36 (1): 107–14.
De Valois, R. L., K. K. De Valois, and Oxford University Press. 1988.
Spatial Vision. Oxford Psychology Series. Oxford University
Press.
DeBonet, J. S., and P. Viola. 1998. “Texture Recognition Using a
Non-Parametric Multi-Scale Statistical Model.” In Proceedings
of the IEEE/CVF Computer Vision and Pattern Recognition.
Deglr6328. 2006. “Blue Sky Spectrum.”
https://en.wikipedia.org/, File:Spectrum_of_blue_sky.png.
Denton, Emily, Ben Hutchinson, Margaret Mitchell, Timnit Gebru, and
Andrew Zaldivar. 2019. “Image Counterfactual Sensitivity Analysis
for Detecting Unintended Bias.” In Computer Vision and
Pattern Recognition Workshop.
DeTone, D., T. Malisiewicz, and A. Rabinovich. 2018. “SuperPoint:
Self-Supervised Interest Point Detection and Description.” In
2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
Workshops (CVPRW), 337–33712.
DeVries, Terrance, Ishan Misra, Changhan Wang, and Laurens van der
Maaten. 2019. “Does Object Recognition Work for Everyone?”
In Computer Vision and Pattern Recognition Workshop.
Doersch, Carl, Abhinav Gupta, and Alexei A Efros. 2015.
“Unsupervised Visual Representation Learning by Context
Prediction.” In Iccv, 1422–30.
Doersch, Carl, Saurabh Singh, Abhinav Gupta, Josef Sivic, and Alexei A.
Efros. 2012. “What Makes Paris Look Like Paris?” ACM
Transactions on Graphics 31 (4): 101:1–9.
Doherty, Paul. 2023. https://www.exploratorium.edu/snacks/cd-spectroscope.
Donahue, Jeff, Yangqing Jia, Oriol Vinyals, Judy Hoffman, Ning Zhang,
Eric Tzeng, and Trevor Darrell. 2014. “Decaf: A Deep Convolutional
Activation Feature for Generic Visual Recognition.” In
Icml, 647–55. PMLR.
Donahue, Jeff, Philipp Krähenbühl, and Trevor Darrell. 2017.
“Adversarial Feature Learning.” In Iclr.
Dosovitskiy, Alexey, Lucas Beyer, Alexander Kolesnikov, Dirk
Weissenborn, Xiaohua Zhai, Thomas Unterthiner, Mostafa Dehghani, et al.
2021. “An Image Is Worth 16x16 Words: Transformers for Image
Recognition at Scale.” Iclr.
Dosovitskiy, Alexey, Philipp Fischer, Eddy Ilg, Philip Häusser, Caner
Hazirbas, Vladimir Golkov, Patrick van der Smagt, Daniel Cremers, and
Thomas Brox. 2015. “FlowNet: Learning Optical Flow with
Convolutional Networks.” In Iccv, 2758–66.
Duda, Richard O., and Peter E. Hart. 1972. “Use of the Hough
Transformation to Detect Lines and Curves in Pictures.”
Communications of the ACM 15 (1): 11–15.
Dwork, C., and A. Roth. 2014. “The Algorithmic Foundations of
Differential Privacy.” Foundations and Trends in Theoretical
Computer Science 9 (3–4): 211–407.
Dwork, Cynthia, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Rich
Zemel. 2012. “Fairness Through Awareness.” In ITCS ’12:
Proceedings of the Third Innovations in Theoretical Computer Science
Conference, 214–26.
Dwork, Cynthia, Nitin Kohli, and Deirdre Mulligan. 2019.
“Differential Privacy in Practice: Expose Your Epsilons!”
Journal of Privacy and Confidentiality 9 (2).
Efros, A. A., and W. T. Freeman. 2001. “Image Quilting for Texture
Synthesis and Transfer.” In ACM SIGGRAPH: Proceedings of the
Annual Conference on Computer Graphics and Interactive Techniques,
341–46.
Efros, A. A., and T. K. Leung. 1999. “Texture Synthesis by
Non-Parametric Sampling.” In Proceedings of the IEEE/CVF
International Conference on Computer Vision.
Elman, Jeffrey L. 1990. “Finding Structure in Time.”
Cognitive Science 14 (2): 179–211.
Elsayed, Gamaleldin F, Ian Goodfellow, and Jascha Sohl-Dickstein. 2018.
“Adversarial Reprogramming of Neural Networks.” https://arxiv.org/abs/1806.11146.
Everingham, M., L. Van Gool, C. K. I. Williams, J. Winn, and A.
Zisserman. 2010. “The PASCAL Visual Object Classes (VOC)
Challenge.” International Journal of Computer Vision 88:
303–38.
Fara, P. 2015. “Newton Shows the Light: A Commentary on Newton
(1672) ‘a Letter … Containing His New Theory about Light and
Colours…’.” Philosophical Transactions of the Royal
Society.
Farrell, Michael, and Cliff Haynes. 2017. Straw Camera. https://strawcamera.com/.
Fashion, Amazon. 2021. “Fun World Adult Cockroach Costume.”
https://www.amazon.com/Fun-World-Costumes-Cockroach-Costume/dp/B0038ZQYRC.
Faugeras, Olivier. 1993. Three-Dimensional Computer Vision: A
Geometric Viewpoint. Cambridge, MA: MIT Press.
Fellbaum, Christiane, ed. 1998. WordNet: An Electronic Lexical
Database. Cambridge, MA: MIT Press.
Felzenszwalb, Pedro F., Ross B. Girshick, David McAllester, and Deva
Ramanan. 2010. “Object Detection with Discriminatively Trained
Part-Based Models.” Pami 32 (9): 1627–45.
Fergus, Rob, Pietro Perona, and Andrew Zisserman. 2007. “Weakly
Supervised Scale-Invariant Learning of Models for Visual
Recognition.” International Journal of Computer Vision
71 (3): 273–303.
Fergus, R., B. Singh, A. Hertzmann, S. Roweis, and W. T. Freeman. 2006.
“Removing Camera Shake from a Single Image.” ACM
Transactions on Graphics 25 (3): 787--794.
Field, David J. 1987. “Relations Between the Statistics of Natural
Images and the Response Properties of Cortical Cells.”
Josa 4 (12): 2379–94.
Fischler, M. A., and R. C. Bolles. 1981. “Random Sample Consensus:
A Paradigm for Model Fitting with Applications to Image Analysis and
Automated Cartography.” Communications of the ACM 24
(6): 381–95.
Fischler, M. A., and R. A. Elschlager. 1973. “The Representation
and Matching of Pictorial Structures.” IEEE Transactions on
Computers C-22 (1): 67–92.
Fleet, D., and A. Jepson. 1989. “Computation of Normal Velocity
from Local Phase Information.” In Proceedings of the IEEE/CVF
Computer Vision and Pattern Recognition, 379–86.
Fleming, R. W., R. O. Dror, and E. H. Adelson. 2001. “Surface
Reflectance Estimation Under Unknown Natural Illumination.”
Journal of Vision.
Fleuret, Francois, and Donald Geman. 2001. “Coarse-to-Fine Face
Detection.” Ijcv 41 (1–2): 85--107.
Fodor, Jerry A. 1975. The Language of Thought. Vol. 5.
Cambridge, MA: Harvard University Press.
Forsyth, David A., and Jean Ponce. 2012. Computer Vision - a Modern
Approach, Second Edition. Pitman.
Fourier, Jean Baptiste Joseph. 2009. Théorie
Analytique de la Chaleur. Cambridge Library Collection.
Cambridge University Press.
Freedman, David H. 2010. Wrong: Why Experts* Keep Failing Us–and How
to Know When Not to Trust Them. New York: Little, Brown & Co.
Freeman, W. T. 1994. “The Generic Viewpoint Assumption in a
Framework for Visual Perception.” Nature 368 (6471):
542–45.
Freeman, W. T., and E. H. Adelson. 1990. “Steerable Filters for
Early Vision, Image Analysis, and Wavelet Decomposition.” In
Proceedings of the IEEE/CVF International Conference on Computer
Vision, 406–15.
———. 1991. “The Design and Use of Steerable Filters.”
Pami 13 (9): 891–906.
Freeman, W. T., E. H. Adelson, and D. J. Heeger. 1991. “Motion
Without Movement.” In ACM SIGGRAPH: Proceedings of the Annual
Conference on Computer Graphics and Interactive Techniques, 27–30.
Freeman, W. T., E. H. Adelson, and A. P. Pentland. 1990.
“Shape-from-Shading Analysis with Bumplets and Shadelets.”
Investigative Ophthalmology and Visual Science (ARVO), 410.
Freeman, W. T., D. B. Anderson, P. A. Beardsley, C. N. Dodge, M. Roth,
C. D. Weissman, W. S. Yerazunis, et al. 1998. “Computer Vision for
Interactive Computer Graphics.” IEEE Computer Graphics and
Applications 18 (3): 42–53.
Freeman, W. T., and D. H. Brainard. 1995. “Bayesian Decision
Theory, the Maximum Local Mass Estimate, and Color Constancy.” In
Proceedings of the IEEE/CVF International Conference on Computer
Vision, 210–17.
Freeman, William T. 2020. “How to Write Good Papers.” In.
Fridovich-Keil, Sara, Alex Yu, Matthew Tancik, Qinhong Chen, Benjamin
Recht, and Angjoo Kanazawa. 2022. “Plenoxels: Radiance Fields
Without Neural Networks.” In Cvpr, 5501–10.
Fukushima, Kunihiko. 1980. “Neocognitron: A Self-Organizing Neural
Network Model for a Mechanism of Pattern Recognition Unaffected by Shift
in Position.” Biological Cybernetics 36 (4): 193–202.
Gabor, Dennis. 1946. “Theory of Communication.” Journal
of the Institution of Electrical Engineers - Part I: General 94:
58–58.
Gage, Philip. 1994. “A New Algorithm for Data Compression.”
The C Users Journal 12 (2): 23–38.
Gagniuc, P. A. 2017. Markov Chains: From Theory to Implementation
and Experimentation. John Wiley & Sons.
Galileo, G. 2015. Sidereus Nuncius, or the Sidereal Messenger.
Chicago: University of Chicago Press.
Gardenia. n.d. https://www.gardenia.net/plants/plant-family/hepatica-liverleaf.
Garvie, Clare, Alvaro Bedoya, and Jonathan Frankle. 2019. “The
Perpetual Line-up.” https://www.perpetuallineup.org/.
Gebru, Timnit, and Emily Denton. 2020. “CVPR Tutorial on Fairness,
Accountability, Transparency, and Ethics in Computer Vision.”
———. 2021. “CVPR Workshop: Beyond Fairness: Towards a Just,
Equitable, and Accountable Computer Vision.” https://sites.google.com/view/beyond-fairness-cv/.
Geiger, A, P Lenz, C Stiller, and R Urtasun. 2013. “Vision Meets
Robotics: The KITTI Dataset.” The International Journal of
Robotics Research 32 (11): 1231–37.
Geman, Donald, Bruno Jedynak, Programme Robotique, and Projet Syntim.
1994. “Shape Recognition and Twenty Questions.” In
Proceedings Reconnaissance Des Formes Et Intelligence
Articielle, 21–37.
Gibson, James J. 1966. The Senses Considered as Perceptual
Systems. Boston: Houghton Mifflin.
———. 1979. The Ecological Approach to Visual Perception.
Boston: Houghton Mifflin.
Gilbert, Rob. n.d. “Quotes.” https://www.quotes.net/quote/13310.
Gilchrist, Alan. 2006. Seeing Black and White. Oxford
University Press.
Gilmer, Justin, Samuel S. Schoenholz, Patrick F. Riley, Oriol Vinyals,
and George E. Dahl. 2017. “Neural Message Passing for Quantum
Chemistry.” In International Conference on Machine
Learning, 1263–72.
Gkioxari, G., J. Johnson, and J. Malik. 2019. “Mesh r-CNN.”
In Iccv, 9784–94.
Glickstein, Mitch. 2006. “Golgi and Cajal: The Neuron Doctrine and
the 100th Anniversary of the 1906 Nobel Prize.” Current
Biology 16 (5): R147–51.
Goodfellow, Ian, Yoshua Bengio, and Aaron Courville. 2016. Deep
Learning. Cambridge, MA: MIT Press.
Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David
Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2014.
“Generative Adversarial Nets.” In Nips. Vol. 27.
Gorkani, M. M., and R. W. Picard. 1994. “Texture Orientation for
Sorting Photos ’at a Glance.’.” In Proceedings of 12th
International Conference on Pattern Recognition, 1:459–464 vol.1.
Gortler, Steven J., Radek Grzeszczuk, Richard Szeliski, and Michael F.
Cohen. 1996. “The Lumigraph.” In Proceedings of the
23rd Annual Conference on Computer Graphics and Interactive
Techniques, 43–54.
Gou, Jianping, Baosheng Yu, Stephen J Maybank, and Dacheng Tao. 2021.
“Knowledge Distillation: A Survey.” Ijcv 129:
1789–819.
Granlund, G. H. 1978. “In Search of a General Picture Processing
Operator.” Computer Graphics, Image Proc. 8: 155–73.
Granlund, G., and H. Knutsson. 1995. Signal Processing for Computer
Vision. New York, NY: Springer.
Granlund, Goesta H. 1978. “In Search of a General Picture
Processing Operator.” Computer Graphics and Image
Processing 8 (2): 155–73.
Griffin, Gregory, Alex Holub, and Pietro Perona. 2007.
“Caltech-256 Object Category Dataset.”
Grother, Patrick, Mei Ngan, and Kayee Hanaoka. 2019. “Face
Recognition Vendor Test (FRVT). Part 3: Demographic Effects.”
NISTIR 8280.
Grünwald, Peter D. 2007. The Minimum Description Length
Principle. Cambridge, MA: MIT Press.
Gupta, Tanmay, and Aniruddha Kembhavi. 2023. “Visual Programming:
Compositional Visual Reasoning Without Training.” In
Cvpr, 14953–62.
Ha, David, Andrew Dai, and Quoc V Le. 2016.
“Hypernetworks.” Iclr.
Hadsell, Raia, Sumit Chopra, and Yann LeCun. 2006. “Dimensionality
Reduction by Learning an Invariant Mapping.” In Cvpr,
2:1735–42.
Hamidi, Foad, Morgan Klaus Scheuerman, and Stacy M. Branham. 2018.
“Gender Recognition or Gender Reductionism?: The Social
Implications of Embedded Gender Recognition Systems.” In CHI
’18: Proceedings of the 2018 CHI Conference on Human Factors in
Computing Systems.
Hammersley, J. M., and P. Clifford. 1971. “Markov Fields on Finite
Graphs and Lattices.” http://www.statslab.cam.ac.uk/~grg/books/hammfest/hamm-cliff.pdf.
Hancock, Peter, Roland Baddeley, and Leslie Smith. 1970. “The
Principal Components of Natural Images.” Network: Computation
in Neural Systems 3.
Hardt, Moritz. 2020. “MLSS 2020, Tübingen.”
Harmon, Leon D., and Bela Julesz. 1973. “Masking in Visual
Recognition: Effects of Two-Dimensional Filtered Noise.”
Science 180 (4091): 1194–97.
Harris, Chris, and Mike Stephens. 1988. “A Combined Corner and
Edge Detector.” In Proc. Of Fourth Alvey Vision
Conference, 147–51.
Hartley, R., and A. Zisserman. 2004. Multiple View Geometry in
Computer Vision. 2nd ed. Cambridge, UK: Cambridge University Press.
Hartline, H. K. 1938. “The Response of Single Optic Nerve Fibers
of the Vertebrate Eye to Illumination of the Retina.”
American Journal of Physiology-Legacy Content 121 (2): 400–415.
Hassenstein, V., and W. Reichardt. 1956. “System Theoretical
Analysis of Time, Sequence and Sign Analysis of the Motion Perception of
the Snout-Beetle Chlorophanus.” Z.
Naturforsch. B 11: 513–24.
Hays, James, and Alexei A Efros. 2007. “Scene Completion Using
Millions of Photographs.” Tog 26 (3): 4.
He, Kaiming, Xinlei Chen, Saining Xie, Yanghao Li, Piotr Dollár, and
Ross Girshick. 2022. “Masked Autoencoders Are Scalable Vision
Learners.” In Cvpr, 15979–88.
He, Kaiming, Haoqi Fan, Yuxin Wu, Saining Xie, and Ross Girshick. 2020.
“Momentum Contrast for Unsupervised Visual Representation
Learning.” In Cvpr, 9729–38.
He, Kaiming, Georgia Gkioxari, Piotr Dollár, and Ross Girshick. 2017.
“Mask r-CNN.” In Iccv, 2961–69.
He, Kaiming, Jian Sun, and Xiaoou Tang. 2009. “Single Image Haze
Removal Using Dark Channel Prior.” In Cvpr, 1956–63.
He, Kaiming, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. 2016.
“Deep Residual Learning for Image Recognition.” In
Cvpr, 770–78.
He, Ruifei, Shuyang Sun, Xin Yu, Chuhui Xue, Wenqing Zhang, Philip Torr,
Song Bai, and Xiaojuan Qi. 2022. “Is Synthetic Data from
Generative Models Ready for Image Recognition?” https://arxiv.org/abs/2210.07574.
Hebb, Donald Olding. 2005. The Organization of Behavior: A
Neuropsychological Theory. Psychology Press.
Hecht, Eugene. 2016. Optics. 5th ed. Hoboken, NJ: Pearson.
Heeger, D. 1995.
Heeger, D. J., and E. P. Simoncelli. 1992. “Model of Visual Motion
Sensing.” In Spatial Vision in Humans and Robots, edited
by L. Harris and M. Jenkin. Cambridge Univ. Press.
Heeger, David J., and James R. Bergen. 1995. “Pyramid-Based
Texture Analysis/Synthesis.” In Computer Graphics
Proceedings, 229–38.
Helmholtz, H. Von. 1962. Treatise on Physiological Optics. Vol.
III. New York: Dover.
Helmholtz, Hermann von. 1925. Helmholtz’s Treatise on Physiological
Optics. Optical Society of America.
Hinton, Geoffrey E. 2002. “Training Products of Experts by
Minimizing Contrastive Divergence.” Neural Computation
14 (8): 1771–1800.
Hinton, Geoffrey, Oriol Vinyals, and Jeff Dean. 2015. “Distilling
the Knowledge in a Neural Network.” https://arxiv.org/abs/1503.02531.
Hirschmüller, H. 2007. “Stereo Processing by Semi-Global Matching
and Mutual Information.” 30 (2): 328–41.
Ho, Jonathan, Ajay Jain, and Pieter Abbeel. 2020. “Denoising
Diffusion Probabilistic Models.” In Nips, 33:6840–51.
Hochreiter, Sepp, and Jürgen Schmidhuber. 1997. “Long Short-Term
Memory.” Neural Computation 9 (8): 1735–80.
Hofer, H., J. Carroll, J. Neitz, M. Neitz, and D. R. Williams. 2005.
“Organization of the Human Trichromatic Cone Mosaic.”
The Journal of Neuroscience 25 (42): 9669–79.
Hoffman, Judy, Eric Tzeng, Taesung Park, Jun-Yan Zhu, Phillip Isola,
Kate Saenko, Alexei Efros, and Trevor Darrell. 2018. “Cycada:
Cycle-Consistent Adversarial Domain Adaptation.” In
Icml, 1989–98. PMLR.
Hoiem, Derek, Alexei A. Efros, and Martial Hebert. 2005.
“Automatic Photo Pop-up.” Tog 24 (3): 577–84.
———. 2008. “Putting Objects in Perspective.” Ijcv
80 (1): 3–15.
Hoogterp, W. 2014. Your Perfect Presentation. McGraw-Hill
Education eBooks.
Hopfield, John J. 1982. “Neural Networks and Physical Systems with
Emergent Collective Computational Abilities.” Proceedings of
the National Academy of Sciences 79 (8): 2554–58.
Horn, B. K. P. 1986. Robot Vision. Cambridge, MA: MIT Press.
Horn, B. K. P., and M. J. Brooks, eds. 1989. Shape from
Shading. Cambridge, MA: MIT Press.
Horn, B. K. P., and B. G. Schunck. 1981. “Determining Optical
Flow.” Artificial Intelligence 17: 185–203.
Horn, Berthold K. P. 1977. “Understanding Image
Intensities.” Artificial Intelligence 8 (2): 201–31.
Hosni, Asmaa, Christoph Rhemann, Michael Bleyer, Carsten Rother, and
Margrit Gelautz. 2013. “Fast Cost-Volume Filtering for Visual
Correspondence and Beyond.” IEEE Transactions on Pattern
Analysis and Machine Intelligence 35 (2): 504–11.
Hospital, San Jose Animal. 2021. http://www.sanjoseanimalhospital.com/puppy-and-kitten-packages.
Hough, Paul V. C. 1959. “Machine Analysis of Bubble Chamber
Pictures.” In International Conference on High Energy
Accelerators and Instrumentation, CERN, 1959, 554–56.
Houthooft, Rein, Richard Y. Chen, Phillip Isola, Bradly C. Stadie, Filip
Wolski, Jonathan Ho, and Pieter Abbeel. 2018. “Evolved Policy
Gradients.” In Nips.
Hu, Edward J, Yelong Shen, Phillip Wallis, Zeyuan Allen-Zhu, Yuanzhi Li,
Shean Wang, Lu Wang, and Weizhu Chen. 2022. “LoRa: Low-Rank
Adaptation of Large Language Models.” In Iclr.
Huang, Yanping, and Rajesh P. N. Rao. 2011. “Predictive
Coding.” Wiley Interdisciplinary Reviews: Cognitive
Science 2 (5): 580–93.
Hubel, D. H., and T. N. Wiesel. 1959. “Receptive Fields of Single
Neurones in the Cat’s Striate Cortex.” The Journal of
Physiology 148 (3): 574–91. https://doi.org/10.1113/jphysiol.1959.sp006308.
———. 1962. “Receptive Fields, Binocular Interaction and Functional
Architecture in the Cat’s Visual Cortex.” J. Physiol.
160: 106–54.
Hutchinson, Ben, and Margaret Mitchell. 2019. “50 Years of Test
(Un)fairness: Lessons for Machine Learning.” In FAT* ’19:
Proceedings of the Conference on Fairness, Accountability, and
Transparency, 49–58.
Huxley, Aldus. 1932. Brave New World. Chatto; Windus.
Ioffe, Sergey, and Christian Szegedy. 2015. “Batch Normalization:
Accelerating Deep Network Training by Reducing Internal Covariate
Shift.” In Icml, 448–56.
Isola, Phillip, Joseph J. Lim, and Edward H. Adelson. 2015.
“Discovering States and Transformations in Image
Collections.” In Cvpr.
Isola, Phillip, Jun-Yan Zhu, Tinghui Zhou, and Alexei A. Efros. 2017.
“Image-to-Image Translation with Conditional Adversarial
Networks.” In Cvpr.
Jaderberg, Max, Karen Simonyan, Andrew Zisserman, and koray kavukcuoglu.
2015. “Spatial Transformer Networks.” In Nips,
2017–25.
Jahanian, Ali, Xavier Puig, Yonglong Tian, and Phillip Isola. 2022.
“Generative Models as a Data Source for Multiview Representation
Learning.” In Iclr.
Jefferys, William H., and James O. Berger. 1992. “Ockham’s Razor
and Bayesian Analysis.” American Scientist 80 (1):
64–72.
Jia, Menglin, Luming Tang, Bor-Chun Chen, Claire Cardie, Serge Belongie,
Bharath Hariharan, and Ser-Nam Lim. 2022. “Visual Prompt
Tuning.” In Eccv, 709–27.
Johansson, Gunnar. 1973. “Visual Perception of Biological Motion
and a Model for Its Analysis.” Perception &
Psychophysics 14 (2): 201–11.
Jordan, M. I., ed. 1998. Learning in Graphical Models.
Cambridge MA: MIT Press.
Julesz, Bela. 1981. “Textons, the Elements of Texture Perception,
and Their Interactions.” Nature 290 (5802): 91–97.
Julez, B. 1971. Foundations of Cyclopean Perception. University
of Chicago Press.
Kac, Mark. 1966. “Can One Hear the Shape of a Drum?”
American Mathematical Monthly.
Kahn, Jeremy. 2021. “HireVue Drops Facial Monitoring Amid AI
Algorithm Audit.” Fortune.
Kahneman, Daniel. 2011. Thinking, Fast and Slow. Macmillan.
Kajiya, James T. 1986. “The Rendering Equation.” In
Siggraph, 143--150.
Kajiya, Jim. 1993. “How to Get Your SIGGRAPH Paper
Rejected.” In SIGGRAPH Papers Chair.
Kalderon, Mark Eli. 2015. Form Without Matter: Empedocles and
Aristotle on Color Perception. Oxford, UK: Oxford University Press.
Kandel, Eric R., James H. Schwartz, and Thomas M. Jessell, eds. 1991.
Principles of Neural Science. Third. New York: Elsevier.
Kanizsa, Gaetano. 1979. Organization in Vision: Essays
on Gestalt Perception. New York: Praeger Publishers.
Kanwisher, Nancy G., Josh McDermott, and Marvin M. Chun. 1997.
“The Fusiform Face Area: A Module in Human Extrastriate Cortex
Specialized for Face Perception.” The Journal of
Neuroscience 17: 4302–11.
Karnieli, Asaf, Ohad Fried, and Yacov Hel-Or. 2022. “DeepShadow:
Neural Shape from Shadow.” In Eccv, 415–30.
Karras, Tero, Miika Aittala, Samuli Laine, Erik Härkönen, Janne
Hellsten, Jaakko Lehtinen, and Timo Aila. 2021. “Alias-Free
Generative Adversarial Networks.” In Nips.
Kaufman, E L, and M W Lord. 1949. “The Discrimination of Visual
Number.” The American Journal of Psychology 62 (4):
498–525.
Kearns, M., and A. Roth. 2020. The Ethical Algorithm: The Science of
Socially Aware Algorithm Design. Oxford University Press.
Kendall, A., H. Martirosyan, S. Dasgupta, P. Henry, R. Kennedy, A.
Bachrach, and A. Bry. 2017. “End-to-End Learning of Geometry and
Context for Deep Stereo Regression.” In Proceedings of the
IEEE/CVF International Conference on Computer Vision.
Kerbl, Bernhard, Georgios Kopanas, Thomas Leimkühler, and George
Drettakis. 2023. “3D Gaussian Splatting for Real-Time Radiance
Field Rendering.” Tog 42 (4): 1–14.
Kim, Taeksoo, Moonsu Cha, Hyunsoo Kim, Jung Kwon Lee, and Jiwon Kim.
2017. “Learning to Discover Cross-Domain Relations with Generative
Adversarial Networks.” In Icml, 1857–65.
Kingdom, Frederick A. A., Ali Yoonessi, and Elena Gheorghiu. 2017.
“The Leaning Tower Illusion.” In The Oxford Compendium of Visual Illusions.
Oxford University Press.
Kingma, Diederik P., and Jimmy Ba. 2014. “Adam: A Method for
Stochastic Optimization.” https://arxiv.org/abs/1412.6980.
Kingma, Diederik P, and Max Welling. 2014. “Auto-Encoding
Variational Bayes.” Iclr.
Kingma, Diederik P, Max Welling, et al. 2019. “An Introduction to
Variational Autoencoders.” Foundations and Trends in Machine
Learning 12 (4): 307–92.
Kingma, Diederik, Tim Salimans, Ben Poole, and Jonathan Ho. 2021.
“Variational Diffusion Models.” In Nips,
34:21696–707.
Kipf, T., and M. Welling. 2017. “Semi-Supervised Classification
with Graph Convolutional Networks.” https://arxiv.org/abs/1609.02907.
Kirillov, Alexander, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe
Rolland, Laura Gustafson, Tete Xiao, et al. 2023. “Segment
Anything.” In Iccv, 4015–26.
Klare, B. F., M. J. Burge, J. C. Klontz, R. W. Vorder Bruegge, and A. K.
Jain. 2012. “Face Recognition Performance: Role of Demographic
Information.” IEEE Trans. On Information Forensics and
Security 7 (6): 1789–1801.
Knill, David C., Pascal Mamassian, and Daniel Kersten. 1997.
“Geometry of Shadows.” Josa 14 (12): 3216–32.
Knuth, D. E., T. L. Larrabee, and P. M. Roberts. 1989. Mathematical
Writing. Mathematical Association of America Notes.
Koch, C, and S Ullman. 1985. “Shifts in Selective Visual
Attention: Towards the Underlying Neural Circuitry.” Human
Neurobiology 4 (4): 219–27.
Koenderink, J. J. 1988. “Image Structure.” In
Mathematics and Computer Science in Medical Imaging, edited by
M. A. Viergever and A. E. Todd-Pokropek, 67–103. Berlin:
Springer-Verlag.
———. 1990. Solid Shape. Cambridge, MA: MIT Press.
Koenderink, J. J., and A. J. van Doorn. 1987. “Representation of
Local Geometry in the Visual System.” Biological
Cybernetics 55: 367–75.
Koffka, Kurt. 1935. Principles of Gestalt Psychology. London:
Routledge.
Koh, Pang Wei, Thao Nguyen, Yew Siang Tang, Stephen Mussmann, Emma
Pierson, Been Kim, and Percy Liang. 2020. “Concept Bottleneck
Models.” In Icml, 5338–48.
Koller, D., and N. Friedman, eds. 2009. Probabilistic Graphical
Models. Cambridge MA: MIT Press.
Kolmogorov, V. 2006. “Convergent Tree-Reweighted Message Passing
for Energy Minimization.” IEEE Transactions on Pattern
Analysis and Machine Intelligence 28 (10).
Komodakis, Nikos, and Spyros Gidaris. 2018. “Unsupervised
Representation Learning by Predicting Image Rotations.” In
Iclr.
Krishna, Ranjay, Yuke Zhu, Oliver Groth, Justin Johnson, Kenji Hata,
Joshua Kravitz, Stephanie Chen, et al. 2017. “Visual Genome:
Connecting Language and Vision Using Crowdsourced Dense Image
Annotations.” Ijcv 123 (1): 32--73.
Krizhevsky, Alex. 2009. “Learning Multiple Layers of Features from
Tiny Images.” University of Toronto, Toronto.
Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E Hinton. 2012.
“Imagenet Classification with Deep Convolutional Neural
Networks.” In Nips, 25:1097–1105.
Kuffler, Stephen W. 1953. “Discharge Patterns and Functional
Organization of Mammalian Retina.” Journal of
Neurophysiology 16 (1): 37–68.
Lamott, Anne. 1980. Bird by Bird. Bantam Doubleday Dell
Publishing Group.
Land, E. H. 1983. “Recent Advances in Retinex Theory and Some
Implications for Cortical Computations: Color Vision and the Natural
Image.” Proceedings of the National Academy of Sciences of
the USA 80: 5163–69.
Land, Edwin Herbert, and John J. McCann. 1971. “Lightness and
Retinex Theory.” Journal of the Optical Society of
America 61 1: 1–11.
Larsson, Gustav, Michael Maire, and Gregory Shakhnarovich. 2016.
“Learning Representations for Automatic Colorization.” In
Eccv, 577–93. Springer.
LeCun, Y. 2006. “A Tutorial on Energy-Based Learning.” http://www.cs.toronto.edu/~vnair/ciar/lecun1.pdf.
———. 2007. “Energy-Based Models: The Cure Against Bayesian
Fundamentalism.” https://www.mit.edu/~9.520/spring07/Classes/lecun-20070502-mit.pdf.
LeCun, Yann, Bernhard Boser, John S Denker, Donnie Henderson, Richard E
Howard, Wayne Hubbard, and Lawrence D Jackel. 1989.
“Backpropagation Applied to Handwritten Zip Code
Recognition.” Neural Computation 1 (4): 541–51.
Lepetit, V., and P. Fua. 2006. “Keypoint Recognition Using
Randomized Trees.” Pami 28 (9): 1465–79.
Leung, T. K., M. C. Burl, and P. Perona. 1995. “Finding Faces in
Cluttered Scenes Using Random Labeled Graph Matching.” In
Iccv, 637–44.
Levin, A., Y. Weiss, F. Durand, and W. T. Freeman. 2011.
“Efficient Marginal Likelihood Optimization in Blind
Deconvolution.” In Proceedings of the IEEE/CVF Computer
Vision and Pattern Recognition.
Levoy, Marc. 2010. “Optics i: Lenses and Apertures.” https://graphics.stanford.edu/courses/cs178-13/lectures/optics1-09apr13.pdf.
Levoy, Marc, and Pat Hanrahan. 1996. “Light Field
Rendering.” In Siggraph, 31--42.
Li, Fei-Fei, Marco Andreeto, Marc’Aurelio Ranzato, and Pietro Perona.
2022. “Caltech 101.” CaltechDATA.
Li, Junnan, Dongxu Li, Silvio Savarese, and Steven Hoi. 2023.
“BLIP-2: Bootstrapping Language-Image Pre-Training
with Frozen Image Encoders and Large Language Models.” In
Icml.
Li, Zhengqi, and Noah Snavely. 2018. “MegaDepth: Learning
Single-View Depth Prediction from Internet Photos.” In
Cvpr.
Lin, T., P. Dollar, R. Girshick, K. He, B. Hariharan, and S. Belongie.
2017. “Feature Pyramid Networks for Object Detection.” In
Cvpr, 936–44.
Lin, Tsung-Yi, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross
Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick,
and Piotr Dollár. 2014. “Microsoft COCO: Common Objects in
Context.” In Eccv, 740–55.
Lindeberg, Tony. 1994. Scale-Space Theory in Computer Vision.
Kluwer Academic Publishers.
Linsker, Ralph. 1988. “Self-Organization in a Perceptual
Network.” Computer 21 (3): 105–17.
Lipson, L., Z. Teed, and J. Deng. 2021. “RAFT-Stereo: Multilevel
Recurrent Field Transforms for Stereo Matching.” In
International Conference on 3D Vision (3DV).
Liu, Ce, William T. Freeman, Edward H. Adelson, and Yair Weiss. 2008.
“Human-Assisted Motion Annotation.” In Cvpr, 1–8.
Liu, Rosanne, Joel Lehman, Piero Molino, Felipe Petroski Such, Eric
Frank, Alex Sergeev, and Jason Yosinski. 2018. “An Intriguing
Failing of Convolutional Neural Networks and the Coordconv
Solution.” https://arxiv.org/abs/1807.03247.
Livingstone, M S, and David H. Hubel. 1984. “Anatomy and
Physiology of a Color System in the Primate Visual Cortex.” In
The Journal of Neuroscience.
Long, Jonathan, Evan Shelhamer, and Trevor Darrell. 2015. “Fully
Convolutional Networks for Semantic Segmentation.” In
Cvpr, 3431–40.
Longuet-Higgens, H. C. 1981. “A Computer Algorithm for
Reconstructing a Scene from Two Projections.” Nature
293: 133–35.
Loop, Charles, and Zhengyou Zhang. 1999. “Computing Rectifying
Homographies for Stereo Vision.” In Cvpr, 1:125–31.
Lowe, D. G. 2004. “Distinctive Image Features from Scale-Invariant
Keypoints.” International Journal of Computer Vision 60
(2): 91–110.
Lowe, David G. 1985. Perceptual Organization and Visual
Recognition. Kluwer Academic Publishers.
Lucas, B. D., and T. Kanade. 1981. “An Iterative Image
Registration Technique with an Application to Stereo Vision.” In
Proceedings of Imaging Understanding Workshop, 121–30.
Maaten, Laurens van der, and Geoffrey Hinton. 2008. “Visualizing
Data Using t-SNE.” Journal of Machine Learning Research
9 (Nov): 2579–2605.
MacKay, David J. C. 2003. Information Theory, Inference and Learning
Algorithms. Cambridge University Press.
Madison, Cindee, William Thompson, Daniel Kersten, Peter Shirley, and
Brian Smits. 2001. “Use of Interreflection and Shadow for Surface
Contact.” Perception & Psychophysics 63 (2): 187–94.
Malik, J., and P. Perona. 1990. “Preattentive Texture
Discrimination with Early Vision Mechanisms.” Journal of the
Optical Society of America A 7: 923–31.
Malisiewicz, Tomasz, and Alexei A. Efros. 2009. “Beyond
Categories: The Visual Memex Model for Reasoning about Object
Relationships.” In Nips.
Marr, D. 1982. Vision: A Computational Investigation into the Human
Representation and Processing of Visual Information. W. H. Freeman,
San Francisco.
———. 2010. Vision. Cambridge, MA: MIT Press.
Marr, D. C., and E. Hildreth. 1980. “Theory of Edge
Detection.” Proceedings of the Royal Society B 207:
187–217.
Martens, Rhonda. 2001. “Optics: Paralipomena to Witelo, and
Optical Part of Astronomy. Johannes Kepler. Translated by William h.
Donahue.” Isis 92 (3): 607–8.
Matas, Jiri, Ondrej Chum, Martin Urban, and Tomas Pajdla. 2004.
“Robust Wide Baseline Stereo from Maximally Stable Extremal
Regions.” Image and Vision Computing 22 (September):
761–67.
Matheron, G. 1975. Random Sets and Integral Geometry. Wiley.
Matusik, Wojciech, Hanspeter Pfister, Matt Brand, and Leonard McMillan.
2002. “Directional Reflectance and Emissivity of an Opaque
Surface.” ACM Transactions on Graphics 22.
Max, Nelson. 1995. “Optical Models for Direct Volume
Rendering.” IEEE Transactions on Visualization and Computer
Graphics 1 (2): 99–108.
Max, Nelson, and Min Chen. 2005. “Local and Global Illumination in
the Volume Rendering Integral.” Lawrence Livermore National
Lab.(LLNL), Livermore, CA (United States).
Mayer, N., E. Ilg, P. Hausser, P. Fischer, D. Cremers abd A.
Dosovitskiy, and T. Brox. 2016. “A Large Dataset to Train
Convolutional Networks for Disparity, Optical Flow, and Scene Flow
Estimation.” In Proceedings of the IEEE/CVF Computer Vision
and Pattern Recognition.
McCann, J. J., S. P. McKee, and T. H. Taylor. 1976. “Quantitative
Studies in Retinex Theory: A Comparison Between Theoretical Predictions
and Observer Responses to the ’Color Mondrian’ Experiments.”
Vision Research 16: 445–58.
McManus, Jim. 2022. “Real Titanic 3D.” http://www.realtitanic3d.com/.
Mead, Carver. 1989. Analog VLSI and Neural Systems. Boston:
Addison-Wesley Longman Publishing.
Mermin, N. David. 1989. “What’s Wrong with These
Equations?” Physics Today.
Mersereau, R. M. 1979. “The Processing of Hexagonally Sampled
Two-Dimensional Signals.” Proceedings of the IEEE 67
(6): 930–49.
Mikolajczyk, Krystian, and Cordelia Schmid. 2002. “An Affine
Invariant Interest Point Detector.” In Eccv, edited by
Anders Heyden, Gunnar Sparr, Mads Nielsen, and Peter Johansen, 128–42.
Mikolajczyk, K., and C. Schmid. 2001. “Indexing Based on Scale
Invariant Interest Points.” In Iccv, 2:525.
———. 2005. “A Performance Evaluation of Local Descriptors.”
Pami 27 (10): 1615–30.
Mildenhall, Ben, Pratul P Srinivasan, Matthew Tancik, Jonathan T Barron,
Ravi Ramamoorthi, and Ren Ng. 2020. “Nerf: Representing Scenes as
Neural Radiance Fields for View Synthesis.” In Eccv,
405–21. Springer.
Minnaert, Marcel. 1993. Light and Color in the Outdoors. New
York: Springer New York.
———. 2012. Light and Color in the Outdoors. New York: Springer.
Minsky, Marvin, and Seymour Papert. 1969. Perceptrons.
Cambridge, MA: MIT Press.
Moghaddam, B., and A. P. Pentland. 1997. “Probabilistic Visual
Learning for Object Representation.” IEEE Transactions on
Pattern Analysis and Machine Intelligence 19 (7): 696–710.
Mozur, P. 2019. “One Month, 500,000 Face Scans.” New
York Times.
Mullainathan, Sendhil. 2019. “Biased Algorithms Are Easier to Fix
Than Biased People.” New York Times.
Mundy, Joseph. 2006. “Object Recognition in the Geometric Era: A
Retrospective.” In Toward Category Level Object
Recognition, 4170:3–28.
Murakami, I., A. Kitaoka, and H. Ashida. 2010. “Artificial Image
Oscillation Enhances the Rotating Snakes Illusion.” Journal
of Vision 6 (6): 551.
Murphy, Christopher J., and Howard C. Howland. 1986. “On the Gekko
Pupil and Scheiner’s Disc.” Vision Research 26 (5):
815–17.
Murphy, Kevin P. 2022. Probabilistic Machine Learning: An
Introduction. Cambridge, MA: MIT Press.
Murphy, Kevin, Yair Weiss, and Michael I. Jordan. 1999. “Loopy
Belief Propagation for Approximate Inference: An Empirical
Study.” In Proceedings of the Fifteenth Conference on
Uncertainty in Artificial Intelligence, 467–75.
Nathan Silberman, Pushmeet Kohli, Derek Hoiem, and Rob Fergus. 2012.
“Indoor Segmentation and Support Inference from RGBD
Images.” In Eccv.
Navez, B. n.d. https://commons.wikimedia.org/w/index.php?curid=855487.
Nayar, Shree K., Katsushi Ikeuchi, and Takeo Kanade. 1991. “Shape
from Interreflections.” Ijcv 6 (3): 173–95.
Necker, L. A. 2005. “Observations on Some Remarkable Optical
Phaenomena Seen in Switzerland; and on an Optical Phaenomenon Which
Occurs on Viewing a Figure of a Crystal or Geometrical Solid.”
London and Edinburgh Philosophical Magazine and Journal of
Science 5 (1): 329–37.
Nene, Samer A., Shree K. Nayar, and Hiroshi Murase. 1996.
“Columbia Object Image Library (COIL-20).” Department of
Computer Science, Columbia University.
Ng, Andrew Y., Michael I. Jordan, and Yair Weiss. 2001. “On
Spectral Clustering: Analysis and an Algorithm.” In
Nips, 849–56.
Nicodemus, Fred E. 1965. “Directional Reflectance and Emissivity
of an Opaque Surface.” Applied Optics 4: 767–75.
Noble, Safiya Umoja. 2018. Algorithms of Oppression. NYU Press,
Inc.
Olah, Chris, Alexander Mordvintsev, and Ludwig Schubert. 2017.
“Feature Visualization.” Distill 2 (11): e7.
Oliva, A., and A. Torralba. 2001. “Modeling the Shape of the
Scene: A Holistic Representation of the Spatial Envelope.”
International Journal of Computer Vision 42(3): 145–75.
Oord, Aaron van den, Yazhe Li, and Oriol Vinyals. 2018.
“Representation Learning with Contrastive Predictive
Coding.” https://arxiv.org/abs/1807.03748.
OpenAI. 2023. “GPT-4 Technical Report.” https://arxiv.org/abs/2303.08774.
———. 2024. “GPT-4V(ision) System Card.”
Oppenheim, A. V., and J. S. Lim. 1981. “The Importance of Phase in
Signals.” Proceedings of the IEEE 69 (5): 529–41.
Oppenheim, Alan V., Alan S. Willsky, and S. Hamid Nawab. 1996.
Signals and Systems, 2nd Ed. Hoboken, NJ: Prentice-Hall.
Ordonez, Vicente, Girish Kulkarni, and Tamara Berg. 2011.
“Im2text: Describing Images Using 1 Million Captioned
Photographs.” In Nips. Vol. 24.
Oren, M., and S. K. Nayar. 1994. “Generalization of Lambert’s
Reflection Model.” In ACM SIGGRAPH: Proceedings of the Annual
Conference on Computer Graphics and Interactive Techniques, 239–46.
Orwell, George. 1948. 1984. Prabhat Prakashan.
Owen, Art B. 2013. Monte Carlo Theory, Methods and Examples. https://artowen.su.domains/mc/.
Palmer, Irvin. 1994. “Rethinking Perceptual Organization: The Role
of Uniform Connectedness.” Psychonomic Bulletin &
Review. 1 (1).
Palmer, S., E. Rosch, and P. Chase. 1981. “Canonical Perspective
and the Perception of Objects.” International Symposium on
Attention and Performance (Attention and Performance IX)., 135–51.
Palmer, Stephen E. 1999. Vision Science: Photons to
Phenomenology. Cambridge, MA: MIT Press.
Pantone. 2020. “Munsell USDA Frozen French Fry Standard.”
https://www.pantone.com/products/munsell/munsell-usda-frozen-french-fry-standard.
Papert, Seymour. 1966. “The Summer Vision Project.” MIT AI
Memo 100. Massachusetts Institute of Technology, Project Mac.
Parikh, Devi, and Dhruv Batra. 2018. “CVPR18 Workshop Panel: How
to Be a Good Citizen of the CVPR Community.”
Paris, Sylvain, Pierre Kornprobst, Jack Tumblin, and Frédo Durand. 2009.
Bilateral Filtering: Theory and Applications. Now Publishers.
Parish, Yoav I. H., and Pascal Müller. 2001. “Procedural Modeling
of Cities.” In Proceedings of the 28th Annual Conference on
Computer Graphics and Interactive Techniques, 301–8.
Pathak, Deepak, Pulkit Agrawal, Alexei A Efros, and Trevor Darrell.
2017. “Curiosity-Driven Exploration by Self-Supervised
Prediction.” In Icml, 2778–87.
Pathak, Deepak, Philipp Krahenbuhl, Jeff Donahue, Trevor Darrell, and
Alexei A Efros. 2016. “Context Encoders: Feature Learning by
Inpainting.” In Cvpr, 2536–44.
Pell, Denis G, Patrick Cavanagh, Robert Desimone, Bosco Tjan, and Anne
Treisman. 2007. “Crowding: Including Illusory Conjunctions,
Surround Suppression, and Attention.” Journal of Vision
7 (2): 1.
Pentland, A. P. 1990. “Linear Shape from Shading.”
International Journal of Computer Vision 1 (4): 153–62.
Perona, P. 1995. “Deformable Kernels for Early Vision.”
IEEE Transactions on Pattern Analysis and Machine Intelligence
17 (5): 488–99.
Perona, P., and J. Malik. 1990a. “Detecting and Localizing Edges
Composed of Steps, Peaks and Roofs.” In Proceedings of the
IEEE/CVF International Conference on Computer Vision.
———. 1990b. “Scale-Space and Edge Detection Using Anisotropic
Diffusion.” IEEE Transactions on Pattern Analysis and Machine
Intelligence 12 (7): 629–39.
Phong, Bui Tuong. 1975. “Illumination for Computer Generated
Pictures.” Commun. ACM 18 (6): 311–17.
Photos, MN. n.d. http://www.flickr.com/photos/mnsomero/2738807250/.
Plato. 360 BCE. Translated by Benjamin Jowett. https://classics.mit.edu/Plato/timaeus.html.
Poggio, T., V. Torre, and C. Koch. 1985. “Computational Vision and
Regularization Theory.” Nature 317 (26): 314–139.
Pollefeys, M., R. Koch, and L. Van Gool. 1999. “A Simple and
Efficient Rectification Method for General Motion.” In
Proceedings of the IEEE/CVF International Conference on Computer
Vision, 496–501.
Portilla, J., and E. P. Simoncelli. 2000. “A Parametric Texture
Model Based on Joint Statistics of Complex Wavelet Coefficients.”
International Journal of Computer Vision 40 (1): 49–71.
Prince, S. J. D. 2012. Computer Vision: Models Learning and
Inference. Cambridge University Press.
Radford, Alec, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh,
Sandhini Agarwal, Girish Sastry, et al. 2021. “Learning
Transferable Visual Models from Natural Language Supervision.” In
Icml, 8748–63.
Ramaswamy, Vikram V., William T. Freeman, Fei-Fei Li, Pietro Perona,
Antonio Torralba, and Olga Russakovsky. 2021. “The Future of
Computer Vision Datasets.” Computer Vision and Pattern
Recognition Workshop.
Ramaswamy, Vikram V., Sunnie S. Y. Kim, and Olga Russakovsky. 2021.
“Fair Attribute Classification Through Latent Space
de-Biasing.” In Proceedings of the IEEE/CVF Computer Vision
and Pattern Recognition.
Ramesh, Aditya, Prafulla Dhariwal, Alex Nichol, Casey Chu, and Mark
Chen. 2022. “Hierarchical Text-Conditional Image Generation with
Clip Latents.” https://arxiv.org/abs/2204.06125.
Ramesh, Aditya, Mikhail Pavlov, Gabriel Goh, Scott Gray, Chelsea Voss,
Alec Radford, Mark Chen, and Ilya Sutskever. 2021. “Zero-Shot
Text-to-Image Generation.” In Icml, 8821–31.
Ramón y Cajal, S. 1893. “La Rétine Des Vertébrés.”
Cellule 9: 119–255.
Ranftl, René, Alexey Bochkovskiy, and Vladlen Koltun. 2021.
“Vision Transformers for Dense Prediction.” In
Iccv.
Ranftl, René, Katrin Lasinger, David Hafner, Konrad Schindler, and
Vladlen Koltun. 2022. “Towards Robust Monocular Depth Estimation:
Mixing Datasets for Zero-Shot Cross-Dataset Transfer.”
Pami 44 (3).
Recasens, Adrià, Pauline Luc, Jean-Baptiste Alayrac, Luyu Wang, Florian
Strub, Corentin Tallec, Mateusz Malinowski, et al. 2021. “Broaden
Your Views for Self-Supervised Video Learning.” Iccv.
Redmon, J., S. Divvala, R. Girshick, and A. Farhadi. 2016. “You
Only Look Once: Unified, Real-Time Object Detection.” In
Cvpr, 779–88.
Ren, Shaoqing, Kaiming He, Ross B. Girshick, and Jian Sun. 2015.
“Faster r-CNN: Towards Real-Time Object Detection with Region
Proposal Networks.” In Nips, 91–99.
Rescorla, Robert A. 1972. “A Theory of Pavlovian Conditioning:
Variations in the Effectiveness of Reinforcement and
Non-Reinforcement.” Classical Conditioning, Current Research
and Theory 2: 64–69.
Roberts, Lawrence G. 1963. Machine Perception of Three-Dimensional
Solids. Outstanding Dissertations in the Computer Sciences. New
York: Garland Publishing.
Rodríguez-Muñoz, Adrián, and Antonio Torralba. 2022. “Aliasing Is
a Driver of Adversarial Attacks.” https://arxiv.org/abs/2212.11760.
Rogaway, P. 2015. “The Moral Character of Cryptographic
Work?” International Association for Cryptologic
Research.
Rombach, Robin, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and
Björn Ommer. 2022. “High-Resolution Image Synthesis with Latent
Diffusion Models.” In Cvpr, 10684–95.
Ronneberger, Olaf, Philipp Fischer, and Thomas Brox. 2015. “U-Net:
Convolutional Networks for Biomedical Image Segmentation.” In
International Conference on Medical Image Computing and
Computer-Assisted Intervention, 234–41. Springer.
Rosch, Eleanor. 1978. Principles of Categorization. De Gruyter
Mouton.
Rosch, Eleanor, Carolyn B. Mervis, Wayne D. Gray, D M Johnson, and Penny
Boyes-Braem. 1976. “Basic Objects in Natural Categories.”
Cognitive Psychology 8: 382–439.
Rosenblatt, Frank. 1958. “The Perceptron: A Probabilistic Model
for Information Storage and Organization in the Brain.”
Psychological Review 65 (6): 386.
Rowley, Henry, Shumeet Baluja, and Takeo Kanade. 1996. “Neural
Network-Based Face Detection.”
Rublee, Ethan, Vincent Rabaud, Kurt Konolige, and Gary Bradski. 2011.
“ORB: An Efficient Alternative to SIFT or SURF.” In
Iccv, 2564–71.
Ruderman, D. L. 1997. “Origins of Scaling in Natural
Images.” Vision Research 37 (23): 3385–98.
Rumelhart, D. E., and J. L. McClelland, eds. 1986. Parallel
Distributed Processing. Cambridge, MA: MIT Press.
Rumelhart, David E., Geoffrey E. Hinton, and Ronald J. Williams. 1985.
“Learning Internal Representations by Error Propagation.”
California Univ San Diego Inst for Cognitive Science.
Russakovsky, Olga, Jia Deng, Hao Su, Jonathan Krause, Sanjeev Satheesh,
Sean Ma, Zhiheng Huang, et al. 2015. “Imagenet Large Scale Visual
Recognition Challenge.” Ijcv 115 (3): 211–52.
Russell, B. C., A. Torralba, K. P. Murphy, and W. T. Freeman. 2008.
“LabelMe: A Database and Web-Based Tool for Image
Annotation.” International Journal of Computer Vision
77: 157–73.
Salimans, Tim, Jonathan Ho, Xi Chen, Szymon Sidor, and Ilya Sutskever.
2017. “Evolution Strategies as a Scalable Alternative to
Reinforcement Learning.” https://arxiv.org/abs/1703.03864.
Sattigeri, Prasanna, Samuel C. Hoffman, Vijil Chenthamarakshan, and Kush
R. Varshney. 2019. “Fairness GAN: Generating Datasets with
Fairness Properties Using a Generative Adversarial Network.” In
Intl. Conf. On Learning Representations (ICLR) Workshop.
Savinov, N., A. Seki, L. Ladicky, T. Sattler, and M. Pollefeys. 2017.
“Quad-Networks: Unsupervised Learning to Rank for Interest Point
Detection.” In Cvpr, 3929–37.
Saxena, Ashutosh, Sung H. Chung, and Andrew Y. Ng. 2008. “3-d
Depth Reconstruction from a Single Still Image.” Ijcv 76
(1): 53–69.
Scharstein, D., and R. Szeliski. 2002. “A Taxonomy and Evaluation
of Dense Two-Frame Stereo Correspondence Algorithms.”
International Journal of Computer Vision 47.
Schmid, Cordelia, Roger Mohr, and Christian Bauckhage. 2000.
“Evaluation of Interest Point Detectors.” Ijcv 37
(2): 151–72.
Schölkopf, Bernhard, Alexander Smola, and Klaus-Robert Müller. 1998.
“Nonlinear Component Analysis as a Kernel Eigenvalue
Problem.” Neural Computation 10 (5): 1299–319.
Schrimpf, Martin, Jonas Kubilius, Michael J Lee, N Apurva Ratan Murty,
Robert Ajemian, and James J DiCarlo. 2020. “Integrative
Benchmarking to Advance Neurally Mechanistic Models of Human
Intelligence.” Neuron.
Shannon, C. E. 1948. “A Mathematical Theory of
Communication.” The Bell System Technical Journal 27
(3): 379–423.
Shapin, S. 2019. “A Theorist of (Not Quite) Everything.”
The New York Review of Books.
Shepard, Roger N. 1990. Mind Sights: Original Visual Illusions,
Ambiguities, and Other Anomalies, with a Commentary on the Play of Mind
in Perception and Art. New York: W.H. Freeman; Co.
Shepard, Roger N., and Jacqueline Metzler. 1971. “Mental Rotation
of Three-Dimensional Objects.” Science 171 (3972):
701–3.
Sherrington, C S. 1906. “Observations on the Scratch-Reflex in the
Spinal Dog.” The Journal of Physiology 34 (1-2): 1–50.
Shi, Jianbo, and Jitendra Malik. 2000. “Normalized Cuts and Image
Segmentation.” Pami 22 (8): 888–905.
Shi, Jianbo, and Tomasi. 1994. “Good Features to Track.” In
Cvpr, 593–600.
Shi, J., and J. Malik. 2000. “Normalized Cuts and Image
Segmentation.” IEEE Transactions on Pattern Analysis and
Machine Intelligence 22 (8): 888–905.
Shirley, Peter, Michael Ashikhmin, and Steve Marschner. 2009.
Fundamentals of Computer Graphics. AK Peters/CRC Press.
Silver, David, Aja Huang, Chris J Maddison, Arthur Guez, Laurent Sifre,
George Van Den Driessche, Julian Schrittwieser, et al. 2016.
“Mastering the Game of Go Ith Deep Neural Networks and Tree
Search.” Nature 529 (7587): 484–89.
Simoncelli, E. P. 2005. “Statistical Modeling of Photographic
Images.” In Handbook of Image and Video Processing,
431–41. Academic Press.
Simoncelli, E. P., and E. H. Adelson. 1996. “Noise Removal via
Bayesian Wavelet Coring.” In International
Conference on Image Processing, 379–82.
Simoncelli, E. P., and W. T. Freeman. 1995. “The Steerable
Pyramid: A Flexible Architecture for Multi-Scale Derivative
Computation.” In International Conference on Image
Processing.
Simoncelli, E. P., W. T. Freeman, E. H. Adelson, and D. J. Heeger. 1992.
“Shiftable Multi-Scale Transforms.” IEEE Transactions
on Information Theory 2 (38): 587–607.
Simoncelli, Eero P., and Edward H. Adelson. 1990. “Subband Image
Coding with Hexagonal Quadrature Mirror Filters.” In Picture
Coding Symposium.
Simonyan, Karen, and Andrew Zisserman. 2015. “Very Deep
Convolutional Networks for Large-Scale Image Recognition.” In
Iclr.
Sitzmann, Vincent, Julien N. P. Martel, Alexander W. Bergman, David B.
Lindell, and Gordon Wetzstein. 2020. “Implicit Neural
Representations with Periodic Activation Functions.” In
Nips.
Smith, A. M. 2001. Alhacen’s Theory of Visual Perception: A Critical
Edition, with English Translation and Commentary, of the First Three
Books of Alhacen’s de Aspectibus, the Medieval Latin Version of Ibn
Al-Haytham’s Kitab Al-Manazir. v. 91, pt. 4. American Philosophical
Society.
Snell, Jake, Karl Ridgeway, Renjie Liao, Brett D Roads, Michael C Mozer,
and Richard S Zemel. 2017. “Learning to Generate Images with
Perceptual Similarity Metrics.” In Icip, 4277–81. IEEE.
Soatto, Stefano. 2013. “Actionable Information in Vision.”
In Machine Learning for Computer Vision, 17–48. Springer.
Sohl-Dickstein, Jascha, Eric Weiss, Niru Maheswaranathan, and Surya
Ganguli. 2015. “Deep Unsupervised Learning Using Nonequilibrium
Thermodynamics.” In Icml, 2256–65.
Sperling, George, Son-Hee Lyu, Chia-Huei Tseng, and Zhong-Lin Lu. 2017.
“The Motion Standstill Illusion.” In The Oxford Compendium of Visual Illusions.
Oxford University Press.
Spillmann, Lothar. 2014. “Receptive Fields of Visual Neurons: The
Early Years.” Perception 43: 1145–76.
Srivastava, Rupesh Kumar, Klaus Greff, and Jürgen Schmidhuber. 2015.
“Highway Networks.” https://arxiv.org/abs/1505.00387.
Steinman, Robert M, Zygmunt Pizlo, and Filip J Pizlo. 2000. “Phi
Is Not Beta, and Why Wertheimer’s Discovery Launched the Gestalt
Revolution.” Vision Research 40 (17): 2257–64.
Strunk, William, and E. B. White. 1999. The Elements of Style.
Boston: Allyn; Bacon.
Sturm, Peter, and Bill Triggs. 1996. “A Factorization Based
Algorithm for Multi-Image Projective Structure and Motion.” In
Eccv, 709–20.
Surís, Dídac, Sachit Menon, and Carl Vondrick. 2023. “ViperGPT:
Visual Inference via Python Execution for Reasoning.” In
Iccv.
Sutskever, Ilya. n.d. https://twitter.com/ilyasut/status/1114658175272095744?s=20.
Sutton, Richard S, and Andrew G Barto. 2018. Reinforcement Learning:
An Introduction. Cambridge, MA: MIT press.
Szegedy, Christian, Wojciech Zaremba, Ilya Sutskever, Joan Bruna,
Dumitru Erhan, Ian Goodfellow, and Rob Fergus. 2014. “Intriguing
Properties of Neural Networks.” In Iclr.
Szeliski, Richard. 2022. Computer Vision Algorithms and
Applications. 2nd ed. Springer.
Tancik, Matthew, Pratul Srinivasan, Ben Mildenhall, Sara Fridovich-Keil,
Nithin Raghavan, Utkarsh Singhal, Ravi Ramamoorthi, Jonathan Barron, and
Ren Ng. 2020. “Fourier Features Let Networks Learn High Frequency
Functions in Low Dimensional Domains.” In Nips,
33:7537–47.
Tarr, Michael J., and Steven Pinker. 1989. “Mental Rotation and
Orientation-Dependence in Shape Recognition.” Cognitive
Psychology 21: 233–82.
Telgarsky, Matus. 2016. “Benefits of Depth in Neural
Networks.” In Conference on Learning Theory, 1517–39.
PMLR.
Thomson, Judith Jarvis. 1985. “The Trolley Problem.”
The Yale Law Journal 94 (6): 1395–415.
Tieu, K., and P. Viola. 2000. “Boosting Image Retrieval.”
In Proceedings of the IEEE/CVF Computer Vision and Pattern
Recognition.
Tomasi, C., and T. Kanade. 1992. “Shape and Motion from Image
Streams Under Orthography: A Factorization Method.”
International Journal of Computer Vision 9 (2): 137–54.
Torralba, A., and A. Efros. 2011. “Unbiased Look at Dataset
Bias.” In Proceedings of the IEEE/CVF Computer Vision and
Pattern Recognition.
Torralba, Antonio. 2009. “How Many Pixels Make an Image?”
Visual Neuroscience 26 (1): 123–31.
Torralba, Antonio, Rob Fergus, and William T. Freeman. 2008. “80
Million Tiny Images: A Large Data Set for Nonparametric Object and Scene
Recognition.” Pami 30 (11): 1958–70.
Torralba, Antonio, and William T. Freeman. 2014. “Accidental
Pinhole and Pinspeck Cameras.” Ijcv 110 (2): 92–112.
Traer, James, and Josh H. McDermott. 2016. “Statistics of Natural
Reverberation Enable Perceptual Separation of Sound and Space.”
Proceedings of the National Academy of Sciences 113 (48):
E7856–65.
Treisman, A M, and G Gelade. 1980. “A Feature-Integration Theory
of Attention.” Cognit Psychol 12 (1): 97–136.
Trucco, Emanuele, and Alessandro Verri. 1998. Introductory
Techniques for 3-d Computer Vision. USA: Prentice Hall PTR.
Turing, Alan M. 2009. Computing Machinery and Intelligence.
Springer.
Tyleček, Radim, and Radim Šára. 2013. “Spatial Pattern Templates
for Recognition of Objects with Regular Structure.” In
Proceedings German Conference on Pattern Recognition.
Saarbrucken, Germany.
Tzeng, Eric, Judy Hoffman, Kate Saenko, and Trevor Darrell. 2017.
“Adversarial Discriminative Domain Adaptation.” In
Cvpr, 7167–76.
Uijlings, Jasper R. R., Koen E. A. van de Sande, Theo Gevers, and Arnold
W. M. Smeulders. 2013. “Selective Search for Object
Recognition.” Ijcv 104 (2): 154–71.
Ullman, S., and Sydney Brenner. 1979. “The Interpretation of
Structure from Motion.” Proceedings of the Royal Society of
London. Series B. Biological Sciences 203 (1153): 405–26.
Ullman, Shimon. 2000. High-Level Vision. Cambridge, MA: MIT
Press.
van der Schaaf, A., and J. H. van Hateren. 1996. “Modelling the
Power Spectra of Natural Images: Statistics and Information.”
Vision Research 36 (17): 2759–70.
Vanessaezekowitz. 1993. “Eye Cone Responses.” https://commons.wikimedia.org/wiki/File:Cones_SMJ2_E.svg.
Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion
Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017.
“Attention Is All You Need.” In Nips, 5998–6008.
Vincent, Pascal, Hugo Larochelle, Yoshua Bengio, and Pierre-Antoine
Manzagol. 2008. “Extracting and Composing Robust Features with
Denoising Autoencoders.” In Icml, 1096–1103.
Viola, P., and M. Jones. 2001. “Rapid Object Detection Using a
Boosted Cascade of Simple Classifiers.” In Proceedings of the
IEEE/CVF Computer Vision and Pattern Recognition.
Waltz, David L. 1972. “Generating Semantic Descriptions from
Drawings of Scenes with Shadows.” PhD Thesis, Artificial
Intelligence Lab Memo. Massachusetts Institute of Technology.
Wandell, Brian. 1995. Foundations of Vision. Sinauer Assoc.
Wang, Dequan, Evan Shelhamer, Shaoteng Liu, Bruno Olshausen, and Trevor
Darrell. 2020. “Fully Test-Time Adaptation by Entropy
Minimization.” https://arxiv.org/abs/2006.10726.
Wang, J. Y. A., and E. H. Adelson. 1994. “Representing Moving
Images with Layers.” IEEE Transactions on Image
Processing 3 (5): 625–38.
Wang, Tongzhou, and Phillip Isola. 2020. “Understanding
Contrastive Representation Learning Through Alignment and Uniformity on
the Hypersphere.” In Icml, 9929–39.
Wang, Zeyu, Klint Qinami, Ioannis Christos Karakozis, Kyle Genova, Prem
Nair, Kenji Hata, and Olga Russakovsky. 2020. “Towards Fairness in
Visual Recognition: Effective Strategies for Bias Mitigation.” In
Proceedings of the IEEE/CVF Computer Vision and Pattern
Recognition.
Weber, M., M. Welling, and P. Perona. 2000. “Towards Automatic
Discovery of Object Categories.” In Proceedings of the
IEEE/CVF Computer Vision and Pattern Recognition, 2:101–8.
Wei, Xi, Tianzhu Zhang, Yan Li, Yongdong Zhang, and Feng Wu. 2020.
“Multi-Modality Cross Attention Network for Image and Sentence
Matching.” In Cvpr, 10941–50.
Weiss, Yair. 2001. “Deriving Intrinsic Images from Image
Sequences.” In Iccv, 2:68–75.
Weiss, Yair, and Edward H. Adelson. 1998. “Slow and Smooth: A
Bayesian Theory for the Combination of Local Motion Signals in Human
Vision.” MIT.
Wertheimer, Max. 1912. “Experimentelle Studien Uber Das Sehen von
Bewegung.” Zeitschrift Fur Psychologie 61.
Wiesel, T. N. 1982. “The Postnatal Development of the Visual
Cortex and the Influence of Environment.” Nature 299:
583–91.
Wikipedia. 2021b. https://en.wikipedia.org/wiki/Image_rectification.
———. 2021a. https://en.wikipedia.org/wiki/Hermann_von_Helmholtz.
Williams, Lance. 1983. “Pyramidal Parametrics.” In
Siggraph, 1–11.
Winston, Patrick. 2016. “How to Speak.” https://vimeo.com/101543862.
Witkin, A. P. 1981. “Recovering Surface Shape and Orientation from
Texture.” Artificial Intelligence 17: 17–45.
Wolfe, Jeremy M. 2000. “Visual Attention.” Seeing,
335–86.
———. 2007. “Guided Search 4.0: Current Progress with a Model of
Visual Search.” In Integrated Models of
Cognitive Systems. Oxford University Press.
Wu, Chenfei, Shengming Yin, Weizhen Qi, Xiaodong Wang, Zecheng Tang, and
Nan Duan. 2023. “Visual ChatGPT: Talking, Drawing and Editing with
Visual Foundation Models.” https://arxiv.org/abs/2303.04671.
Wu, Fa-Yueh. 1982. “The Potts Model.” Rev. Mod.
Phys. 54 (1): 235–68.
Xian, Ke, Chunhua Shen, Zhiguo Cao, Hao Lu, Yang Xiao, Ruibo Li, and
Zhenbo Luo. 2018. “Monocular Relative Depth Perception with Web
Stereo Data Supervision.” In Cvpr, 311–20.
Yedidia, J. S., W. T. Freeman, and Y. Weiss. 2001. “Generalized
Belief Propagation.” In Nips, 13:689–95.
Yi, Zili, Hao Zhang, Ping Tan, and Minglun Gong. 2017. “Dualgan:
Unsupervised Dual Learning for Image-to-Image Translation.” In
Iccv, 2849–57.
Zabih, R., and V. Komogorov. 2004. “What Energy Functions Can Be
Minimized via Graph Cuts?” In European Conf. Computer
Vision, 26:147–59.
Zbontar, J., and Y. LeCun. 2015. “Computing the Stereo Matching
Cost with a Convolutional Neural Network.” In Proceedings of
the IEEE/CVF Computer Vision and Pattern Recognition, 1592–99.
Zhang, Chiyuan, Samy Bengio, Moritz Hardt, Benjamin Recht, and Oriol
Vinyals. 2016. “Understanding Deep Learning Requires Rethinking
Generalization.” https://arxiv.org/abs/1611.03530.
Zhang, Feihu, Victor Prisacariu, Ruigang Yang, and Philip HS Torr. 2019.
“GA-Net: Guided Aggregation Net for End-to-End Stereo
Matching.” In Proceedings of the IEEE/CVF Computer Vision and
Pattern Recognition, 185–94.
Zhang, Feihu, Xiaojuan Qi, Ruigang Yang, Victor Prisacariu, Benjamin
Wah, and Philip Torr. 2019. “Domain-Invariant Stereo Matching
Networks.” In European Conference on Computer Vision.
Zhang, L., B. Curless, A. Hertzmann, and S. M. Seitz. 2003. “Shape
and Motion Under Varying Illumination: Unifying Structure from Motion,
Photometric Stereo, and Multi-View Stereo.” In Proceedings of
the IEEE/CVF International Conference on Computer Vision.
Zhang, Lvmin, Anyi Rao, and Maneesh Agrawala. 2023. “Adding
Conditional Control to Text-to-Image Diffusion Models.” In
Iccv.
Zhang, Richard, Phillip Isola, and Alexei A Efros. 2016. “Colorful
Image Colorization.” In Eccv, 649–66. Springer.
———. 2017. “Split-Brain Autoencoders: Unsupervised Learning by
Cross-Channel Prediction.” In Cvpr, 1058–67.
Zhang, Z. 2000. “A Flexible New Technique for Camera
Calibration.” Pami 22 (11): 1330–34.
Zhao, Jieyu, Tianlu Wang, Mark Yatskar, Vicente Ordonez, and Kai-Wei
Chang. 2017. “Men Also Like Shopping: Reducing Gender Bias
Amplification Using Corpus-Level Constraints.” In Proceedings
of the Conference on Empirical Methods in Natural Language Processing
(EMNLP).
Zhou, Bolei, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio
Torralba. 2015. “Object Detectors Emerge in Deep Scene
CNNs.” Iclr.
Zhou, Tinghui, Matthew Brown, Noah Snavely, and David G. Lowe. 2017.
“Unsupervised Learning of Depth and Ego-Motion from Video.”
In Cvpr, 6612–19.
Zhou, Yichao, Haozhi Qi, Jingwei Huang, and Yi Ma. 2019.
“NeurVPS: Neural Vanishing Point Scanning via Conic
Convolution.” In Nips.
Zhu, Jun-Yan, Taesung Park, Phillip Isola, and Alexei A Efros. 2017.
“Unpaired Image-to-Image Translation Using Cycle-Consistent
Adversarial Networks.” In Iccv.