Misplaced Pages

Theoretical computer science: Difference between revisions

Article snapshot taken from[REDACTED] with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 05:38, 7 October 2014 editAndywear1 (talk | contribs)68 editsNo edit summary← Previous edit Revision as of 03:30, 22 November 2014 edit undoBrirush (talk | contribs)Extended confirmed users4,123 edits Massive edit expanding into summary style. Each summary is composed of material taken directly from the corresponding page.Next edit →
Line 3: Line 3:
'''Theoretical computer science''' is a division or subset of general ] and ] which focuses on more abstract or mathematical aspects of computing and includes the ]. '''Theoretical computer science''' is a division or subset of general ] and ] which focuses on more abstract or mathematical aspects of computing and includes the ].


== Scope ==
It is not easy to circumscribe the theory areas precisely and the ]'s ] (SIGACT) describes its mission as the promotion of theoretical computer science and notes:<ref>{{cite web | title = SIGACT | url = http://sigact.acm.org | accessdate = 2009-03-29}}</ref> It is not easy to circumscribe the theory areas precisely and the ]'s ] (SIGACT) describes its mission as the promotion of theoretical computer science and notes:<ref>{{cite web | title = SIGACT | url = http://sigact.acm.org | accessdate = 2009-03-29}}</ref>


Line 9: Line 8:


To this list, the ACM's journal Transactions on Computation Theory adds ], ] and theoretical computer science aspects of areas such as ], ], economic models and ].<ref>{{cite web | title = ToCT| url = http://toct.acm.org/journal.html | accessdate = 2010-06-09}}</ref> Despite this broad scope, the "theory people" in computer science self-identify as different from the "applied people." Some characterize themselves as doing the "(more fundamental) 'science(s)' underlying the field of computing."<ref>{{cite web | title = Challenges for Theoretical Computer Science: Theory as the Scientific Foundation of Computing | url = http://www.research.att.com/%7Edsj/nsflist.html#Intro | accessdate = 2009-03-29}}</ref> Other "theory-applied people" suggest that it is impossible to separate theory and application. This means that the so-called "theory people" regularly use experimental science(s) done in less-theoretical areas such as ] research. It also means that there is more cooperation than mutually exclusive competition between theory and application. To this list, the ACM's journal Transactions on Computation Theory adds ], ] and theoretical computer science aspects of areas such as ], ], economic models and ].<ref>{{cite web | title = ToCT| url = http://toct.acm.org/journal.html | accessdate = 2010-06-09}}</ref> Despite this broad scope, the "theory people" in computer science self-identify as different from the "applied people." Some characterize themselves as doing the "(more fundamental) 'science(s)' underlying the field of computing."<ref>{{cite web | title = Challenges for Theoretical Computer Science: Theory as the Scientific Foundation of Computing | url = http://www.research.att.com/%7Edsj/nsflist.html#Intro | accessdate = 2009-03-29}}</ref> Other "theory-applied people" suggest that it is impossible to separate theory and application. This means that the so-called "theory people" regularly use experimental science(s) done in less-theoretical areas such as ] research. It also means that there is more cooperation than mutually exclusive competition between theory and application.

{| style="border:1px solid #ddd; text-align:center; margin: 0 auto;" cellspacing="15"
| <math> P \rightarrow Q \,</math>
| ]
| ]
| ]
| ]
| '''P = NP''' ?
|-
| ]
| ]
| ]
| ]
| ]
| ]
|-
| '''GNITIRW-TERCES'''
| <math>\Gamma\vdash x : Int</math>
| ]
| ]
| ]
| ]
|-
| ]
| ]
| ]
| ]
| ]
| ]
|}


== History == == History ==
Line 50: Line 19:


Modern theoretical computer science research is based on these basic developments, but includes many other mathematical and interdisciplinary problems that have been posed. Modern theoretical computer science research is based on these basic developments, but includes many other mathematical and interdisciplinary problems that have been posed.

== Topics ==

===Algorithms===
{{main|Algorithm}}
An ] is a step-by-step procedure for calculations. Algorithms are used for ], ], and ].

An algorithm is an ] expressed as a ] list<ref>"Any classical mathematical algorithm, for example, can be described in a finite number of English words" (Rogers 1987:2).</ref> of well-defined instructions<ref>Well defined with respect to the agent that executes the algorithm: "There is a computing agent, usually human, which can react to the instructions and carry out the computations" (Rogers 1987:2).</ref> for calculating a ].<ref>"an algorithm is a procedure for computing a ''function'' (with respect to some chosen notation for integers) ... this limitation (to numerical functions) results in no loss of generality", (Rogers 1987:1).</ref> Starting from an initial state and initial input (perhaps ]),<ref>"An algorithm has ] or more inputs, i.e., ] which are given to it initially before the algorithm begins" (Knuth 1973:5).</ref> the instructions describe a ] that, when ], proceeds through a finite<ref>"A procedure which has all the characteristics of an algorithm except that it possibly lacks finiteness may be called a 'computational method'" (Knuth 1973:5).</ref> number of well-defined successive states, eventually producing "output"<ref>"An algorithm has one or more outputs, i.e. quantities which have a specified relation to the inputs" (Knuth 1973:5).</ref> and terminating at a final ending state. The transition from one state to the next is not necessarily ]; some algorithms, known as ], incorporate random input.<ref>Whether or not a process with random interior processes (not including the input) is an algorithm is debatable. Rogers opines that: "a computation is carried out in a discrete stepwise fashion, without use of continuous methods or analogue devices . . . carried forward deterministically, without resort to random methods or devices, e.g., dice" Rogers 1987:2.</ref>

===Data structures===
{{main|Data structure}}
A ] is a particular way of organizing ] in a computer so that it can be used ].<ref>Paul E. Black (ed.), entry for ''data structure'' in '']. U.S. ]. 15 December 2004. Accessed May 21, 2009.''</ref><ref>Entry ''data structure'' in the ] (2009) accessed on May 21, 2009.</ref>

Different kinds of data structures are suited to different kinds of applications, and some are highly specialized to specific tasks. For example, databases use ] indexes for small percentages of data retrieval and ]s and databases use dynamic ]s as look up tables.

Data structures provide a means to manage large amounts of data efficiently for uses such as large ]s and ]. Usually, efficient data structures are key to designing efficient ]s. Some formal design methods and ]s emphasize data structures, rather than algorithms, as the key organizing factor in software design. Storing and retrieving can be carried out on data stored in both ] and in ].

===Computational complexity theory===
{{main|Computational complexity theory}}
] is a branch of the ] that focuses on classifying ] according to their inherent difficulty, and relating those ] to each other. A computational problem is understood to be a task that is in principle amenable to being solved by a computer, which is equivalent to stating that the problem may be solved by mechanical application of mathematical steps, such as an ].

A problem is regarded as inherently difficult if its solution requires significant resources, whatever the ] used. The theory formalizes this intuition, by introducing mathematical ] to study these problems and quantifying the amount of resources needed to solve them, such as time and storage. Other ] measures are also used, such as the amount of communication (used in ]), the number of ] in a circuit (used in ]) and the number of processors (used in ]). One of the roles of computational complexity theory is to determine the practical limits on what ]s can and cannot do.

===Distributed computation===
{{main|Distributed computation}}
] studies distributed systems. A distributed system is a software system in which components located on ] communicate and coordinate their actions by ].<ref name="Coulouris">{{cite book|last=Coulouris|first=George|author2=Jean Dollimore|author3=Tim Kindberg|author4=Gordon Blair|title=Distributed Systems: Concepts and Design (5th Edition)|publisher = Addison-Wesley|year=2011|location=Boston|isbn=0-132-14301-1}}</ref> The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components.<ref name="Coulouris"/> Examples of distributed systems vary from ] to ]s to ].

A ] that runs in a distributed system is called a '''distributed program''', and distributed programming is the process of writing such programs.<ref>{{harvtxt|Andrews|2000}}. {{harvtxt|Dolev|2000}}. {{harvtxt|Ghosh|2007}}, p. 10.</ref> There are many alternatives for the message passing mechanism, including ] connectors and ]. An important goal and challenge of distributed systems is ].

===Parallel computation===
{{main|Parallel computation}}
] is a form of ] in which many calculations are carried out simultaneously,<ref>{{cite book|last=Gottlieb|first=Allan|title=Highly parallel computing|year=1989|publisher=Benjamin/Cummings|location=Redwood City, Calif.|isbn=0-8053-0177-1|url=http://dl.acm.org/citation.cfm?id=160438|author2=Almasi, George S.}}</ref> operating on the principle that large problems can often be divided into smaller ones, which are then solved ] ("in parallel"). There are several different forms of parallel computing: ], ], ], and ]. Parallelism has been employed for many years, mainly in ], but interest in it has grown lately due to the physical constraints preventing ].<ref>S.V. Adve et al. (November 2008). (PDF). Parallel@Illinois, University of Illinois at Urbana-Champaign. "The main techniques for these performance benefits&nbsp;– increased clock frequency and smarter but increasingly complex architectures&nbsp;– are now hitting the so-called power wall. The computer industry has accepted that future performance increases must largely come from increasing the number of processors (or cores) on a die, rather than making a single core go faster."</ref> As power consumption (and consequently heat generation) by computers has become a concern in recent years,<ref>Asanovic et al. Old : Power is free, but transistors are expensive. New is power is expensive, but transistors are "free".</ref> parallel computing has become the dominant paradigm in ], mainly in the form of ]s.<ref name="View-Power">Asanovic, Krste et al. (December 18, 2006). (PDF). University of California, Berkeley. Technical Report No. UCB/EECS-2006-183. "Old : Increasing clock frequency is the primary method of improving processor performance. New : Increasing parallelism is the primary method of improving processor performance&nbsp;... Even representatives from Intel, a company generally associated with the 'higher clock-speed is better' position, warned that traditional approaches to maximizing performance through maximizing clock speed have been pushed to their limit."</ref>

] are more difficult to write than sequential ones,<ref>{{cite book|last=Hennessy|first=John L.|title=Computer organization and design : the hardware/software interface|year=1999|publisher=Kaufmann|location=San Francisco|isbn=1-55860-428-6|edition=2. ed., 3rd print.|author2=Patterson, David A. |author3=Larus, James R. }}</ref> because concurrency introduces several new classes of potential ]s, of which ]s are the most common. ] and ] between the different subtasks are typically some of the greatest obstacles to getting good parallel program performance.

The maximum possible ] of a single program as a result of parallelization is known as ].

===Very-large-scale integration===
{{main|VLSI}}
] ('''VLSI''') is the process of creating an ] (IC) by combining thousands of ] into a single chip. VLSI began in the 1970s when complex ] and ] technologies were being developed. The ] is a VLSI device. Before the introduction of VLSI technology most ICs had a limited set of functions they could perform. An ] might consist of a ], ], ] and other ]. VLSI lets IC makers add all of these into one chip.

===Machine learning===
{{main|Machine learning}}
] is a ] that deals with the construction and study of ]s that can ] from data.<ref>{{cite journal |title=Glossary of terms |author1=Ron Kovahi |author2=Foster Provost |journal=] |volume=30 |pages=271–274 |year=1998 |url=http://ai.stanford.edu/~ronnyk/glossary.html}}</ref> Such algorithms operate by building a ] based on inputs<ref name="bishop">{{cite book |author=C. M. Bishop |authorlink=Christopher M. Bishop |year=2006 |title=Pattern Recognition and Machine Learning |publisher=Springer |isbn=0-387-31073-8}}</ref>{{rp|2}} and using that to make predictions or decisions, rather than following only explicitly programmed instructions.

Machine learning can be considered a subfield of computer science and ]. It has strong ties to ] and ], which deliver methods, theory and application domains to the field. Machine learning is employed in a range of computing tasks where designing and programming explicit, rule-based ]s is infeasible. Example applications include ]ing, ] (OCR),<ref name=Wernick-Signal-Proc-July-2010>Wernick, Yang, Brankov, Yourganov and Strother, Machine Learning in Medical Imaging, '']'', vol. 27, no. 4, July 2010, pp. 25-38</ref> ] and ]. Machine learning is sometimes conflated with ],<ref>{{cite conference |last=Mannila |first=Heikki |title=Data mining: machine learning, statistics, and databases |conference=Int'l Conf. Scientific and Statistical Database Management |publisher=IEEE Computer Society |year=1996}}</ref> although that focuses more on exploratory data analysis.<ref>{{cite journal |last=Friedman |first=Jerome H. |authorlink=Jerome H. Friedman |title=Data Mining and Statistics: What's the connection? |journal=Computing Science and Statistics |volume=29 |issue=1 |year=1998 |pages=3–9}}</ref> Machine learning and ] "can be viewed as two facets of
the same field."<ref name="bishop"/>{{rp|vii}}

===Computational biology===
{{main|Computational biology}}
] involves the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavioral, and social systems.<ref name="nih">
{{cite web
| url = http://www.bisti.nih.gov/docs/compubiodef.pdf
| title = NIH working definition of bioinformatics and computational biology
| date = 17 July 2000
| accessdate = 18 August 2012
| publisher = Biomedical Information Science and Technology Initiative
}}
</ref> The field is broadly defined and includes foundations in computer science, ], ], ], ], ], ], ], ], ], ], ], ], ], and ].<ref name="brown">
{{cite web
| url = http://www.brown.edu/research/projects/computational-molecular-biology/
| title = About the CCMB
| accessdate = 18 August 2012
| publisher = Center for Computational Molecular Biology
}}
</ref>

Computational biology is different from ], which is a subfield of computer science and ] using ] and ] to build ]s, but is similar to ], which is an interdisciplinary science using computers to store and process biological data.

===Computational geometry===
{{main|Computational geometry}}
] is a branch of computer science devoted to the study of algorithms which can be stated in terms of ]. Some purely geometrical problems arise out of the study of computational geometric algorithms, and such problems are also considered to be part of computational geometry. While modern computational geometry is a recent development, it is one of the oldest fields of computing with history stretching back to antiquity. An ancient precursor is the ] treatise ] , or "Rules of the Chord", that is a book of algorithms written in 800 BCE. The book prescribes step-by-step procedures for constructing geometric objects like altars using a peg and chord.

The main impetus for the development of computational geometry as a discipline was progress in ] and computer-aided design and manufacturing (]/]), but many problems in computational geometry are classical in nature, and may come from ].

Other important applications of computational geometry include ] (motion planning and visibility problems), ]s (GIS) (geometrical location and search, route planning), ] design (IC geometry design and verification), ] (CAE) (mesh generation), ] (3D reconstruction).

===Information theory===
{{main|Information theory}}
] is a branch of ], ], and ] involving the ] of ]. Information theory was developed by ] to find fundamental limits on ] operations such as ] and on reliably ] and ] data. Since its inception it has broadened to find applications in many other areas, including ], ], ], ],<ref>{{cite book|author=F. Rieke, D. Warland, R Ruyter van Steveninck, W Bialek|title=Spikes: Exploring the Neural Code|publisher=The MIT press|year=1997|isbn=978-0262681087}}</ref> the evolution<ref>cf. Huelsenbeck, J. P., F. Ronquist, R. Nielsen and J. P. Bollback (2001) Bayesian inference of phylogeny and its impact on evolutionary biology, ''Science'' '''294''':2310-2314</ref> and function<ref>Rando Allikmets, Wyeth W. Wasserman, Amy Hutchinson, Philip Smallwood, Jeremy Nathans, Peter K. Rogan, , Michael Dean (1998) Organization of the ABCR gene: analysis of promoter and splice junction sequences, ''Gene'' '''215''':1, 111-122</ref> of molecular codes, model selection in ecology,<ref>Burnham, K. P. and Anderson D. R. (2002) ''Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach, Second Edition'' (Springer Science, New York) ISBN 978-0-387-95364-9.</ref> thermal physics,<ref>Jaynes, E. T. (1957) , ''Phys. Rev.'' '''106''':620</ref> ], ], plagiarism detection<ref>Charles H. Bennett, Ming Li, and Bin Ma (2003) , ''Scientific American'' '''288''':6, 76-81</ref>, ], ] and other forms of ].<ref>
{{Cite web
| author = David R. Anderson
| title = Some background on why people in the empirical sciences may want to better understand the information-theoretic methods
| date = November 1, 2003
| url = http://aicanderson2.home.comcast.net/~aicanderson2/home.pdf
| format = pdf
| accessdate = 2010-06-23}}
</ref>

Applications of fundamental topics of information theory include ] (e.g. ]), ] (e.g. ]s and ]s), and ] (e.g. for ]). The field is at the intersection of ], ], ], ], ], and ]. Its impact has been crucial to the success of the ] missions to deep space, the invention of the compact disc, the feasibility of mobile phones, the development of the ], the study of ] and of human perception, the understanding of ]s, and numerous other fields. Important sub-fields of information theory are ], ], ], ], ], and measures of information.

===Cryptography===
{{main|Cryptography}}
] is the practice and study of techniques for ] in the presence of third parties (called ]).<ref name="rivest90">{{cite book|first=Ronald L.|last=Rivest|authorlink=Ron Rivest|editor=J. Van Leeuwen|title=Handbook of Theoretical Computer Science|chapter=Cryptology|volume=1|publisher=Elsevier|year=1990}}</ref> More generally, it is about constructing and analyzing ]s that overcome the influence of adversaries<ref name="modern-crypto">{{Cite book|first1=Mihir|last1=Bellare|first2=Phillip|last2=Rogaway|title=Introduction to Modern Cryptography|chapter=Introduction|page=10|date=21 September 2005}}</ref> and that are related to various aspects in ] such as data ], ], ], and ].<ref name="hac"/> Modern cryptography intersects the disciplines of ], ], and ]. Applications of cryptography include ], ], and ].

Modern cryptography is heavily based on mathematical theory and computer science practice; cryptographic algorithms are designed around ]s, making such algorithms hard to break in practice by any adversary. It is theoretically possible to break such a system, but it is infeasible to do so by any known practical means. These schemes are therefore termed computationally secure; theoretical advances, e.g., improvements in ] algorithms, and faster computing technology require these solutions to be continually adapted. There exist ] schemes that {{not a typo|provably}} cannot be broken even with unlimited computing power—an example is the ]—but these schemes are more difficult to implement than the best theoretically breakable but computationally secure mechanisms.

===Quantum computation===
{{main|Quantum computation}}
A ] is a ] system that makes direct use of ] ], such as ] and ], to perform ] on ].<ref>"" article in '']'' by ] and ]</ref> Quantum computers are different from digital computers based on ]s. Whereas digital computers require data to be encoded into binary digits (]s), each of which is always in one of two definite states (0 or 1), quantum computation uses ] (quantum bits), which can be in ] of states. A theoretical model is the ], also known as the universal quantum computer. Quantum computers share theoretical similarities with ] and ]; one example is the ability to be in more than one state simultaneously. The field of quantum computing was first introduced by ] in 1980<ref name="manin1980vychislimoe">{{cite book| author=Manin, Yu. I.| title=Vychislimoe i nevychislimoe |trans_title=Computable and Noncomputable | year=1980| publisher=Sov.Radio| url=http://publ.lib.ru/ARCHIVES/M/MANIN_Yuriy_Ivanovich/Manin_Yu.I._Vychislimoe_i_nevychislimoe.(1980).%5Bdjv%5D.zip| pages=13–15| language=Russian| accessdate=4 March 2013}}</ref> and ] in 1982.<ref name="Feynman82">{{cite journal |last=Feynman |first=R. P. |title=Simulating physics with computers |journal=] |year=1982 |volume=21 |issue=6 |pages=467–488 |doi=10.1007/BF02650179 }}</ref><ref>{{cite journal |title=Quantum computation |authorlink=David Deutsch |first=David |last=Deutsch |journal=Physics World |date=1992-01-06 }}</ref> A quantum computer with spins as quantum bits was also formulated for use as a quantum ] in 1968.<ref>{{cite book |first=David |last=Finkelstein |chapter=Space-Time Structure in High Energy Interactions |title=Fundamental Interactions at High Energy |editor1-first=T. |editor1-last=Gudehus |editor2-first=G. |editor2-last=Kaiser |location=New York |publisher=Gordon & Breach |year=1968 }}</ref>

{{as of|2014}}, quantum computing is still in its infancy but experiments have been carried out in which quantum computational operations were executed on a very small number of qubits.<ref>{{cite web|url=http://phys.org/news/2013-01-qubit-bodes-future-quantum.html|title=New qubit control bodes well for future of quantum computing|publisher=|accessdate=26 October 2014}}</ref> Both practical and theoretical research continues, and many national governments and military funding agencies support quantum computing research to develop quantum ]s for both civilian and national security purposes, such as ].<ref> for a sense of where the research is heading.</ref>

===Computational number theory===
{{main|Computational number theory}}
], also known as '''algorithmic number theory''', is the study of ]s for performing ] ]s. The best known problem in the field is ].

===Symbolic computation===
{{main|Symbolic computation}}
], also called symbolic computation or algebraic computation is a scientific area that refers to the study and development of ]s and ] for manipulating ] and other ]s. Although, properly speaking, computer algebra should be a subfield of ], they are generally considered as distinct fields because scientific computing is usually based on ] with approximate ]s, while symbolic computation emphasizes ''exact'' computation with expressions containing ]s that have not any given value and are thus manipulated as symbols (therefore the name of ''symbolic computation'').

] applications that perform symbolic calculations are called '']s'', with the term ''system'' alluding to the complexity of the main applications that include, at least, a method to represent mathematical data in a computer, a user programming language (usually different from the language used for the implementation), a dedicated memory manager, a ] for the input/output of mathematical expressions, a large set of ] to perform usual operations, like simplification of expressions, ] using ], ], ], etc.

===Program semantics===
{{main|Program semantics}}
In ], '''semantics''' is the field concerned with the rigorous mathematical study of the meaning of ]s. It does so by evaluating the meaning of ] legal ] defined by a specific programming language, showing the computation involved. In such a case that the evaluation would be of syntactically illegal strings, the result would be non-computation. Semantics describes the processes a computer follows when executing a program in that specific language. This can be shown by describing the relationship between the input and output of a program, or an explanation of how the program will execute on a certain ], hence creating a ].

===Formal methods===
{{main|Formal methods}}
] are a particular kind of ] based techniques for the ], development and ] of ] and ] systems.<ref name="butler">{{cite web|author=R. W. Butler|title=What is Formal Methods?|url=http://shemesh.larc.nasa.gov/fm/fm-what.html|date=2001-08-06|accessdate=2006-11-16}}</ref> The use of formal methods for software and hardware design is motivated by the expectation that, as in other engineering disciplines, performing appropriate mathematical analysis can contribute to the reliability and robustness of a design.<ref>{{cite journal|author=C. Michael Holloway|title=Why Engineers Should Consider Formal Methods|url=http://klabs.org/richcontent/verification/holloway/nasa-97-16dasc-cmh.pdf| publisher=16th Digital Avionics Systems Conference (27–30 October 1997)|accessdate=2006-11-16}}</ref>

Formal methods are best described as the application of a fairly broad variety of theoretical computer science fundamentals, in particular ] calculi, ]s, ], and ], but also ] and ] to problems in software and hardware specification and verification.<ref>Monin, pp.3-4</ref>

===Automata theory===
{{main|Automata theory}}
] is the study of '']s'' and '']'', as well as the computational problems that can be solved using them. It is a theory in theoretical computer science, under ] (a section of ] and also of ]). ''Automata'' comes from the Greek word αὐτόματα meaning "self-acting".

Automata Theory is the study of self-operating virtual machines to help in logical understanding of input and output process, without or with intermediate stage(s) of ] (or any ] / ]).

===Coding theory===
{{main|Coding theory}}
] is the study of the properties of codes and their fitness for a specific application. Codes are used for ], ], ] and more recently also for ]. Codes are studied by various scientific disciplines—such as ], ], ], and ]—for the purpose of designing efficient and reliable ] methods. This typically involves the removal of redundancy and the correction (or detection) of errors in the transmitted data.

===Computational learning theory===
{{main|Computational learning theory}}
Theoretical results in machine learning mainly deal with a type of
inductive learning called supervised learning. In supervised
learning, an algorithm is given samples that are labeled in some
useful way. For example, the samples might be descriptions of
mushrooms, and the labels could be whether or not the mushrooms are
edible. The algorithm takes these previously labeled samples and
uses them to induce a classifier. This classifier is a function that
assigns labels to samples including the samples that have never been
previously seen by the algorithm. The goal of the supervised learning
algorithm is to optimize some measure of performance such as
minimizing the number of mistakes made on new samples.


== Organizations == == Organizations ==

Revision as of 03:30, 22 November 2014

This article is about the branch of computer science and mathematics. For the journal, see Theoretical Computer Science (journal).

Theoretical computer science is a division or subset of general computer science and mathematics which focuses on more abstract or mathematical aspects of computing and includes the theory of computation.

It is not easy to circumscribe the theory areas precisely and the ACM's Special Interest Group on Algorithms and Computation Theory (SIGACT) describes its mission as the promotion of theoretical computer science and notes:

The field of theoretical computer science is interpreted broadly so as to include algorithms, data structures, computational complexity theory, distributed computation, parallel computation, VLSI, machine learning, computational biology, computational geometry, information theory, cryptography, quantum computation, computational number theory and algebra, program semantics and verification, automata theory, and the study of randomness. Work in this field is often distinguished by its emphasis on mathematical technique and rigor.

To this list, the ACM's journal Transactions on Computation Theory adds coding theory, computational learning theory and theoretical computer science aspects of areas such as databases, information retrieval, economic models and networks. Despite this broad scope, the "theory people" in computer science self-identify as different from the "applied people." Some characterize themselves as doing the "(more fundamental) 'science(s)' underlying the field of computing." Other "theory-applied people" suggest that it is impossible to separate theory and application. This means that the so-called "theory people" regularly use experimental science(s) done in less-theoretical areas such as software system research. It also means that there is more cooperation than mutually exclusive competition between theory and application.

History

Main article: History of computer science

While formal algorithms have existed for millennia (Euclid's algorithm for determining the greatest common divisor of two numbers is still used in computation), it was not until 1936 that Alan Turing, Alonzo Church and Stephen Kleene formalized the definition of an algorithm in terms of computation. While binary and logical systems of mathematics had existed before 1703, when Gottfried Leibniz formalized logic with binary values for true and false. While logical inference and mathematical proof had existed in ancient times, in 1931 Kurt Gödel proved with his incompleteness theorem that there were fundamental limitations on what statements could be proved or disproved.

These developments have led to the modern study of logic and computability, and indeed the field of theoretical computer science as a whole. Information theory was added to the field with a 1948 mathematical theory of communication by Claude Shannon. In the same decade, Donald Hebb introduced a mathematical model of learning in the brain. With mounting biological data supporting this hypothesis with some modification, the fields of neural networks and parallel distributed processing were established. In 1971, Stephen Cook and, working independently, Leonid Levin, proved that there exist practically relevant problems that are NP-complete – a landmark result in computational complexity theory.

With the development of quantum mechanics in the beginning of the 20th century came the concept that mathematical operations could be performed on an entire particle wavefunction. In other words, one could compute functions on multiple states simultaneously. This led to the concept of a quantum computer in the latter half of the 20th century that took off in the 1990s when Peter Shor showed that such methods could be used to factor large numbers in polynomial time, which, if implemented, would render most modern public key cryptography systems uselessly insecure.

Modern theoretical computer science research is based on these basic developments, but includes many other mathematical and interdisciplinary problems that have been posed.

Topics

Algorithms

Main article: Algorithm

An algorithm is a step-by-step procedure for calculations. Algorithms are used for calculation, data processing, and automated reasoning.

An algorithm is an effective method expressed as a finite list of well-defined instructions for calculating a function. Starting from an initial state and initial input (perhaps empty), the instructions describe a computation that, when executed, proceeds through a finite number of well-defined successive states, eventually producing "output" and terminating at a final ending state. The transition from one state to the next is not necessarily deterministic; some algorithms, known as randomized algorithms, incorporate random input.

Data structures

Main article: Data structure

A data structure is a particular way of organizing data in a computer so that it can be used efficiently.

Different kinds of data structures are suited to different kinds of applications, and some are highly specialized to specific tasks. For example, databases use B-tree indexes for small percentages of data retrieval and compilers and databases use dynamic hash tables as look up tables.

Data structures provide a means to manage large amounts of data efficiently for uses such as large databases and internet indexing services. Usually, efficient data structures are key to designing efficient algorithms. Some formal design methods and programming languages emphasize data structures, rather than algorithms, as the key organizing factor in software design. Storing and retrieving can be carried out on data stored in both main memory and in secondary memory.

Computational complexity theory

Main article: Computational complexity theory

Computational complexity theory is a branch of the theory of computation that focuses on classifying computational problems according to their inherent difficulty, and relating those classes to each other. A computational problem is understood to be a task that is in principle amenable to being solved by a computer, which is equivalent to stating that the problem may be solved by mechanical application of mathematical steps, such as an algorithm.

A problem is regarded as inherently difficult if its solution requires significant resources, whatever the algorithm used. The theory formalizes this intuition, by introducing mathematical models of computation to study these problems and quantifying the amount of resources needed to solve them, such as time and storage. Other complexity measures are also used, such as the amount of communication (used in communication complexity), the number of gates in a circuit (used in circuit complexity) and the number of processors (used in parallel computing). One of the roles of computational complexity theory is to determine the practical limits on what computers can and cannot do.

Distributed computation

Main article: Distributed computation

Distributed computing studies distributed systems. A distributed system is a software system in which components located on networked computers communicate and coordinate their actions by passing messages. The components interact with each other in order to achieve a common goal. Three significant characteristics of distributed systems are: concurrency of components, lack of a global clock, and independent failure of components. Examples of distributed systems vary from SOA-based systems to massively multiplayer online games to peer-to-peer applications.

A computer program that runs in a distributed system is called a distributed program, and distributed programming is the process of writing such programs. There are many alternatives for the message passing mechanism, including RPC-like connectors and message queues. An important goal and challenge of distributed systems is location transparency.

Parallel computation

Main article: Parallel computation

Parallel computing is a form of computation in which many calculations are carried out simultaneously, operating on the principle that large problems can often be divided into smaller ones, which are then solved concurrently ("in parallel"). There are several different forms of parallel computing: bit-level, instruction level, data, and task parallelism. Parallelism has been employed for many years, mainly in high-performance computing, but interest in it has grown lately due to the physical constraints preventing frequency scaling. As power consumption (and consequently heat generation) by computers has become a concern in recent years, parallel computing has become the dominant paradigm in computer architecture, mainly in the form of multi-core processors.

Parallel computer programs are more difficult to write than sequential ones, because concurrency introduces several new classes of potential software bugs, of which race conditions are the most common. Communication and synchronization between the different subtasks are typically some of the greatest obstacles to getting good parallel program performance.

The maximum possible speed-up of a single program as a result of parallelization is known as Amdahl's law.

Very-large-scale integration

Main article: VLSI

Very-large-scale integration (VLSI) is the process of creating an integrated circuit (IC) by combining thousands of transistors into a single chip. VLSI began in the 1970s when complex semiconductor and communication technologies were being developed. The microprocessor is a VLSI device. Before the introduction of VLSI technology most ICs had a limited set of functions they could perform. An electronic circuit might consist of a CPU, ROM, RAM and other glue logic. VLSI lets IC makers add all of these into one chip.

Machine learning

Main article: Machine learning

Machine learning is a scientific discipline that deals with the construction and study of algorithms that can learn from data. Such algorithms operate by building a model based on inputs and using that to make predictions or decisions, rather than following only explicitly programmed instructions.

Machine learning can be considered a subfield of computer science and statistics. It has strong ties to artificial intelligence and optimization, which deliver methods, theory and application domains to the field. Machine learning is employed in a range of computing tasks where designing and programming explicit, rule-based algorithms is infeasible. Example applications include spam filtering, optical character recognition (OCR), search engines and computer vision. Machine learning is sometimes conflated with data mining, although that focuses more on exploratory data analysis. Machine learning and pattern recognition "can be viewed as two facets of the same field."

Computational biology

Main article: Computational biology

Computational biology involves the development and application of data-analytical and theoretical methods, mathematical modeling and computational simulation techniques to the study of biological, behavioral, and social systems. The field is broadly defined and includes foundations in computer science, applied mathematics, animation, statistics, biochemistry, chemistry, biophysics, molecular biology, genetics, genomics, ecology, evolution, anatomy, neuroscience, and visualization.

Computational biology is different from biological computation, which is a subfield of computer science and computer engineering using bioengineering and biology to build computers, but is similar to bioinformatics, which is an interdisciplinary science using computers to store and process biological data.

Computational geometry

Main article: Computational geometry

Computational geometry is a branch of computer science devoted to the study of algorithms which can be stated in terms of geometry. Some purely geometrical problems arise out of the study of computational geometric algorithms, and such problems are also considered to be part of computational geometry. While modern computational geometry is a recent development, it is one of the oldest fields of computing with history stretching back to antiquity. An ancient precursor is the Sanskrit treatise Shulba Sutras , or "Rules of the Chord", that is a book of algorithms written in 800 BCE. The book prescribes step-by-step procedures for constructing geometric objects like altars using a peg and chord.

The main impetus for the development of computational geometry as a discipline was progress in computer graphics and computer-aided design and manufacturing (CAD/CAM), but many problems in computational geometry are classical in nature, and may come from mathematical visualization.

Other important applications of computational geometry include robotics (motion planning and visibility problems), geographic information systems (GIS) (geometrical location and search, route planning), integrated circuit design (IC geometry design and verification), computer-aided engineering (CAE) (mesh generation), computer vision (3D reconstruction).

Information theory

Main article: Information theory

Information theory is a branch of applied mathematics, electrical engineering, and computer science involving the quantification of information. Information theory was developed by Claude E. Shannon to find fundamental limits on signal processing operations such as compressing data and on reliably storing and communicating data. Since its inception it has broadened to find applications in many other areas, including statistical inference, natural language processing, cryptography, neurobiology, the evolution and function of molecular codes, model selection in ecology, thermal physics, quantum computing, linguistics, plagiarism detection, pattern recognition, anomaly detection and other forms of data analysis.

Applications of fundamental topics of information theory include lossless data compression (e.g. ZIP files), lossy data compression (e.g. MP3s and JPEGs), and channel coding (e.g. for Digital Subscriber Line (DSL)). The field is at the intersection of mathematics, statistics, computer science, physics, neurobiology, and electrical engineering. Its impact has been crucial to the success of the Voyager missions to deep space, the invention of the compact disc, the feasibility of mobile phones, the development of the Internet, the study of linguistics and of human perception, the understanding of black holes, and numerous other fields. Important sub-fields of information theory are source coding, channel coding, algorithmic complexity theory, algorithmic information theory, information-theoretic security, and measures of information.

Cryptography

Main article: Cryptography

Cryptography is the practice and study of techniques for secure communication in the presence of third parties (called adversaries). More generally, it is about constructing and analyzing protocols that overcome the influence of adversaries and that are related to various aspects in information security such as data confidentiality, data integrity, authentication, and non-repudiation. Modern cryptography intersects the disciplines of mathematics, computer science, and electrical engineering. Applications of cryptography include ATM cards, computer passwords, and electronic commerce.

Modern cryptography is heavily based on mathematical theory and computer science practice; cryptographic algorithms are designed around computational hardness assumptions, making such algorithms hard to break in practice by any adversary. It is theoretically possible to break such a system, but it is infeasible to do so by any known practical means. These schemes are therefore termed computationally secure; theoretical advances, e.g., improvements in integer factorization algorithms, and faster computing technology require these solutions to be continually adapted. There exist information-theoretically secure schemes that provably cannot be broken even with unlimited computing power—an example is the one-time pad—but these schemes are more difficult to implement than the best theoretically breakable but computationally secure mechanisms.

Quantum computation

Main article: Quantum computation

A quantum computer is a computation system that makes direct use of quantum-mechanical phenomena, such as superposition and entanglement, to perform operations on data. Quantum computers are different from digital computers based on transistors. Whereas digital computers require data to be encoded into binary digits (bits), each of which is always in one of two definite states (0 or 1), quantum computation uses qubits (quantum bits), which can be in superpositions of states. A theoretical model is the quantum Turing machine, also known as the universal quantum computer. Quantum computers share theoretical similarities with non-deterministic and probabilistic computers; one example is the ability to be in more than one state simultaneously. The field of quantum computing was first introduced by Yuri Manin in 1980 and Richard Feynman in 1982. A quantum computer with spins as quantum bits was also formulated for use as a quantum space–time in 1968.

As of 2014, quantum computing is still in its infancy but experiments have been carried out in which quantum computational operations were executed on a very small number of qubits. Both practical and theoretical research continues, and many national governments and military funding agencies support quantum computing research to develop quantum computers for both civilian and national security purposes, such as cryptanalysis.

Computational number theory

Main article: Computational number theory

Computational number theory, also known as algorithmic number theory, is the study of algorithms for performing number theoretic computations. The best known problem in the field is integer factorization.

Symbolic computation

Main article: Symbolic computation

Computer algebra, also called symbolic computation or algebraic computation is a scientific area that refers to the study and development of algorithms and software for manipulating mathematical expressions and other mathematical objects. Although, properly speaking, computer algebra should be a subfield of scientific computing, they are generally considered as distinct fields because scientific computing is usually based on numerical computation with approximate floating point numbers, while symbolic computation emphasizes exact computation with expressions containing variables that have not any given value and are thus manipulated as symbols (therefore the name of symbolic computation).

Software applications that perform symbolic calculations are called computer algebra systems, with the term system alluding to the complexity of the main applications that include, at least, a method to represent mathematical data in a computer, a user programming language (usually different from the language used for the implementation), a dedicated memory manager, a user interface for the input/output of mathematical expressions, a large set of routines to perform usual operations, like simplification of expressions, differentiation using chain rule, polynomial factorization, indefinite integration, etc.

Program semantics

Main article: Program semantics

In programming language theory, semantics is the field concerned with the rigorous mathematical study of the meaning of programming languages. It does so by evaluating the meaning of syntactically legal strings defined by a specific programming language, showing the computation involved. In such a case that the evaluation would be of syntactically illegal strings, the result would be non-computation. Semantics describes the processes a computer follows when executing a program in that specific language. This can be shown by describing the relationship between the input and output of a program, or an explanation of how the program will execute on a certain platform, hence creating a model of computation.

Formal methods

Main article: Formal methods

Formal methods are a particular kind of mathematically based techniques for the specification, development and verification of software and hardware systems. The use of formal methods for software and hardware design is motivated by the expectation that, as in other engineering disciplines, performing appropriate mathematical analysis can contribute to the reliability and robustness of a design.

Formal methods are best described as the application of a fairly broad variety of theoretical computer science fundamentals, in particular logic calculi, formal languages, automata theory, and program semantics, but also type systems and algebraic data types to problems in software and hardware specification and verification.

Automata theory

Main article: Automata theory

Automata theory is the study of abstract machines and automata, as well as the computational problems that can be solved using them. It is a theory in theoretical computer science, under Discrete mathematics (a section of Mathematics and also of Computer Science). Automata comes from the Greek word αὐτόματα meaning "self-acting".

Automata Theory is the study of self-operating virtual machines to help in logical understanding of input and output process, without or with intermediate stage(s) of computation (or any function / process).

Coding theory

Main article: Coding theory

Coding theory is the study of the properties of codes and their fitness for a specific application. Codes are used for data compression, cryptography, error-correction and more recently also for network coding. Codes are studied by various scientific disciplines—such as information theory, electrical engineering, mathematics, and computer science—for the purpose of designing efficient and reliable data transmission methods. This typically involves the removal of redundancy and the correction (or detection) of errors in the transmitted data.

Computational learning theory

Main article: Computational learning theory

Theoretical results in machine learning mainly deal with a type of inductive learning called supervised learning. In supervised learning, an algorithm is given samples that are labeled in some useful way. For example, the samples might be descriptions of mushrooms, and the labels could be whether or not the mushrooms are edible. The algorithm takes these previously labeled samples and uses them to induce a classifier. This classifier is a function that assigns labels to samples including the samples that have never been previously seen by the algorithm. The goal of the supervised learning algorithm is to optimize some measure of performance such as minimizing the number of mistakes made on new samples.

Organizations

Journals and newsletters

Conferences

See also

Notes

  1. "SIGACT". Retrieved 2009-03-29.
  2. "ToCT". Retrieved 2010-06-09.
  3. "Challenges for Theoretical Computer Science: Theory as the Scientific Foundation of Computing". Retrieved 2009-03-29.
  4. "Any classical mathematical algorithm, for example, can be described in a finite number of English words" (Rogers 1987:2).
  5. Well defined with respect to the agent that executes the algorithm: "There is a computing agent, usually human, which can react to the instructions and carry out the computations" (Rogers 1987:2).
  6. "an algorithm is a procedure for computing a function (with respect to some chosen notation for integers) ... this limitation (to numerical functions) results in no loss of generality", (Rogers 1987:1).
  7. "An algorithm has zero or more inputs, i.e., quantities which are given to it initially before the algorithm begins" (Knuth 1973:5).
  8. "A procedure which has all the characteristics of an algorithm except that it possibly lacks finiteness may be called a 'computational method'" (Knuth 1973:5).
  9. "An algorithm has one or more outputs, i.e. quantities which have a specified relation to the inputs" (Knuth 1973:5).
  10. Whether or not a process with random interior processes (not including the input) is an algorithm is debatable. Rogers opines that: "a computation is carried out in a discrete stepwise fashion, without use of continuous methods or analogue devices . . . carried forward deterministically, without resort to random methods or devices, e.g., dice" Rogers 1987:2.
  11. Paul E. Black (ed.), entry for data structure in Dictionary of Algorithms and Data Structures. U.S. National Institute of Standards and Technology. 15 December 2004. Online version Accessed May 21, 2009.
  12. Entry data structure in the Encyclopædia Britannica (2009) Online entry accessed on May 21, 2009.
  13. ^ Coulouris, George; Jean Dollimore; Tim Kindberg; Gordon Blair (2011). Distributed Systems: Concepts and Design (5th Edition). Boston: Addison-Wesley. ISBN 0-132-14301-1.
  14. Andrews (2000) harvtxt error: no target: CITEREFAndrews2000 (help). Dolev (2000) harvtxt error: no target: CITEREFDolev2000 (help). Ghosh (2007) harvtxt error: no target: CITEREFGhosh2007 (help), p. 10.
  15. Gottlieb, Allan; Almasi, George S. (1989). Highly parallel computing. Redwood City, Calif.: Benjamin/Cummings. ISBN 0-8053-0177-1.
  16. S.V. Adve et al. (November 2008). "Parallel Computing Research at Illinois: The UPCRC Agenda" (PDF). Parallel@Illinois, University of Illinois at Urbana-Champaign. "The main techniques for these performance benefits – increased clock frequency and smarter but increasingly complex architectures – are now hitting the so-called power wall. The computer industry has accepted that future performance increases must largely come from increasing the number of processors (or cores) on a die, rather than making a single core go faster."
  17. Asanovic et al. Old : Power is free, but transistors are expensive. New is power is expensive, but transistors are "free".
  18. Asanovic, Krste et al. (December 18, 2006). "The Landscape of Parallel Computing Research: A View from Berkeley" (PDF). University of California, Berkeley. Technical Report No. UCB/EECS-2006-183. "Old : Increasing clock frequency is the primary method of improving processor performance. New : Increasing parallelism is the primary method of improving processor performance ... Even representatives from Intel, a company generally associated with the 'higher clock-speed is better' position, warned that traditional approaches to maximizing performance through maximizing clock speed have been pushed to their limit."
  19. Hennessy, John L.; Patterson, David A.; Larus, James R. (1999). Computer organization and design : the hardware/software interface (2. ed., 3rd print. ed.). San Francisco: Kaufmann. ISBN 1-55860-428-6.
  20. Ron Kovahi; Foster Provost (1998). "Glossary of terms". Machine Learning. 30: 271–274.
  21. ^ C. M. Bishop (2006). Pattern Recognition and Machine Learning. Springer. ISBN 0-387-31073-8.
  22. Wernick, Yang, Brankov, Yourganov and Strother, Machine Learning in Medical Imaging, IEEE Signal Processing Magazine, vol. 27, no. 4, July 2010, pp. 25-38
  23. Mannila, Heikki (1996). Data mining: machine learning, statistics, and databases. Int'l Conf. Scientific and Statistical Database Management. IEEE Computer Society.
  24. Friedman, Jerome H. (1998). "Data Mining and Statistics: What's the connection?". Computing Science and Statistics. 29 (1): 3–9.
  25. "NIH working definition of bioinformatics and computational biology" (PDF). Biomedical Information Science and Technology Initiative. 17 July 2000. Retrieved 18 August 2012.
  26. "About the CCMB". Center for Computational Molecular Biology. Retrieved 18 August 2012.
  27. F. Rieke, D. Warland, R Ruyter van Steveninck, W Bialek (1997). Spikes: Exploring the Neural Code. The MIT press. ISBN 978-0262681087.{{cite book}}: CS1 maint: multiple names: authors list (link)
  28. cf. Huelsenbeck, J. P., F. Ronquist, R. Nielsen and J. P. Bollback (2001) Bayesian inference of phylogeny and its impact on evolutionary biology, Science 294:2310-2314
  29. Rando Allikmets, Wyeth W. Wasserman, Amy Hutchinson, Philip Smallwood, Jeremy Nathans, Peter K. Rogan, Thomas D. Schneider, Michael Dean (1998) Organization of the ABCR gene: analysis of promoter and splice junction sequences, Gene 215:1, 111-122
  30. Burnham, K. P. and Anderson D. R. (2002) Model Selection and Multimodel Inference: A Practical Information-Theoretic Approach, Second Edition (Springer Science, New York) ISBN 978-0-387-95364-9.
  31. Jaynes, E. T. (1957) Information Theory and Statistical Mechanics, Phys. Rev. 106:620
  32. Charles H. Bennett, Ming Li, and Bin Ma (2003) Chain Letters and Evolutionary Histories, Scientific American 288:6, 76-81
  33. David R. Anderson (November 1, 2003). "Some background on why people in the empirical sciences may want to better understand the information-theoretic methods" (pdf). Retrieved 2010-06-23.
  34. Rivest, Ronald L. (1990). "Cryptology". In J. Van Leeuwen (ed.). Handbook of Theoretical Computer Science. Vol. 1. Elsevier.
  35. Bellare, Mihir; Rogaway, Phillip (21 September 2005). "Introduction". Introduction to Modern Cryptography. p. 10.
  36. Cite error: The named reference hac was invoked but never defined (see the help page).
  37. "Quantum Computing with Molecules" article in Scientific American by Neil Gershenfeld and Isaac L. Chuang
  38. Manin, Yu. I. (1980). Vychislimoe i nevychislimoe (in Russian). Sov.Radio. pp. 13–15. Retrieved 4 March 2013. {{cite book}}: Unknown parameter |trans_title= ignored (|trans-title= suggested) (help)
  39. Feynman, R. P. (1982). "Simulating physics with computers". International Journal of Theoretical Physics. 21 (6): 467–488. doi:10.1007/BF02650179.
  40. Deutsch, David (1992-01-06). "Quantum computation". Physics World.
  41. Finkelstein, David (1968). "Space-Time Structure in High Energy Interactions". In Gudehus, T.; Kaiser, G. (eds.). Fundamental Interactions at High Energy. New York: Gordon & Breach.
  42. "New qubit control bodes well for future of quantum computing". Retrieved 26 October 2014.
  43. Quantum Information Science and Technology Roadmap for a sense of where the research is heading.
  44. R. W. Butler (2001-08-06). "What is Formal Methods?". Retrieved 2006-11-16.
  45. C. Michael Holloway. "Why Engineers Should Consider Formal Methods" (PDF). 16th Digital Avionics Systems Conference (27–30 October 1997). Retrieved 2006-11-16. {{cite journal}}: Cite journal requires |journal= (help)
  46. Monin, pp.3-4
  47. ^ The 2007 Australian Ranking of ICT Conferences: tier A+.
  48. ^ The 2007 Australian Ranking of ICT Conferences: tier A.
  49. FCT 2011 (retrieved 2013-06-03)

Further reading

External links

Computer science
Note: This template roughly follows the 2012 ACM Computing Classification System.
Hardware
Computer systems organization
Networks
Software organization
Software notations and tools
Software development
Theory of computation
Algorithms
Mathematics of computing
Information systems
Security
Human–computer interaction
Concurrency
Artificial intelligence
Machine learning
Graphics
Applied computing
Categories:
Theoretical computer science: Difference between revisions Add topic