Research Projects for PhD Students

Detlef Plump
Programming Languages and Systems Group
Department of Computer Science
University of York

The following PhD projects offer opportunities for research ranging from theoretical work to implementation work. The projects are examples, I am happy to discuss other ideas for PhD topics.

Graph Programs for Graph Algorithms

GP 2 is an experimental rule-based language for problem solving in the domain of graphs [Plump 2012, Plump 2009]. The GP 2 environment consists of a compiler which generates C code [Bak 2015] and a graphical editor for programs and graphs. In addition, there exists a reference interpreter [Bak et al. 2015].

The purpose of this project is to demonstrate that rooted GP 2 programs can match the complexity of graph algorithms in conventional programming languages. Moreover, a methodology for refining unrooted programs to efficient rooted programs should be developed. Rooted programs allow constant-time access to distinguished nodes in the host graph. For example, [Bak 2015] uses a rooted program to check in linear time whether input graphs (of bounded node degree) are 2-colourable. Timing experiments demonstrate that on grid graphs of up to 100,000 nodes, the GP 2 program matches the run time of Sedgewick's tailor-made C program for 2-colourability [Sedgewick 2002]. However, on graphs of unbounded degree, such as star graphs, the program runs in quadratic time.

This project will conduct a range of case studies on graph algorithms in conventional languages, as found in textbooks such as [Sedgewick 2002, Cormen et al. 2014, Skiena 2011], and attempt to design equivalent GP 2 programs of comparable complexity. The run times of these programs should be experimentally compared with published C programs for the algorithms in question. To obtain fast run times on graphs of unbounded degree, the GP 2 compiler should be improved by avoiding redundant search after backtracking.

By evaluating the case studies, it may be possible to develop a methodology for refining unrooted programs to efficient rooted programs. This is desirable because unrooted programs, which may be highly non-deterministic, are often more concise and easier to design than rooted programs.

References

D. Plump (2012), The Design of GP 2. In Proc. Reduction Strategies in Rewriting and Programming (WRS 2011), Electronic Proceedings in Theoretical Computer Science 82, pages 1-16
D. Plump (2009), The Graph Programming Language GP. In Proc. Algebraic Informatics (CAI 2009), volume 5725 of Lecture Notes in Computer Science, pages 99-122, Springer
C. Bak, G. Faulkner, D. Plump and C. Runciman (2015), A Reference Interpreter for the Graph Programming Language GP 2. In Proc. Graphs as Models (GaM 2015), Electronic Proceedings in Theoretical Computer Science 181, pages 48-64
C. Bak (2015), GP 2: Efficient Implementation of a Graph Programming Language. PhD thesis, University of York
R. Sedgewick (2014), Algorithms in C - Part 5: Graph Algorithms. 3rd edition, Addison-Wesley
T.H. Cormen, C.E. Leiserson, R.L. Rivest and C. Stein (2014), Introduction to Algorithms. 3rd edition, MIT Press
S.S. Skiena (2011), The Algorithm Design Manual. 2nd edition, Springer

Random Graph Generation with Graph Grammars

To test programs (in any language) working on graphs, it is highly desirable to generate graphs randomly within a specified space of graphs that is relevant for a given application. Available graph generators lack the latter feature. On the other hand, random string generators exist that generate strings uniformly at random within a user-defined context-free language. The goal of this project is to develop algorithms that generate graphs uniformly at random within a user-defined graph class. The graph class may be defined by a context-free graph grammar [Drewes et al. 1999], a graph reduction system or even by a graph program [Plump 2009].

In the undergraduate project [Coxon 2013], a tool has been implemented which produces random graphs over context-free graph grammars. The user specifies a hyperedge-replacement grammar [Drewes et al. 1999] and a size range, and the tool generates a sequence of random graphs from the language of that grammar. The hallmark of this generator is that users can specify the shape of the graphs to be generated by means of the graph grammar. For example, one may want to generate trees or flow graphs only.

A limitation of this approach is that it requires an unambiguous grammar as input in order to generate graphs with a uniform distribution. However, many practical examples of graph grammars are ambiguous and in this case many isomorphic copies of already generated graphs may be produced. In contrast, [Santini 2010, Bertoni et al. 2000] generate strings uniformly at random over possibly ambiguous context-free string grammars. This approach should be adapted to context-free graph grammars and possibly to other graph generating formalisms.

Moreover, it would be desirable to generate graphs randomly with non-uniform distributions which may be more appropriate for certain applications.

References

F. Drewes, A. Habel, H.-J. Kreowski (1999), Hyperedge Replacement Graph Grammars. In Handbook of Graph Grammars and Computing by Graph Transformation, Volume 1: Foundations, World-Scientific
D. Plump (2009), The Graph Programming Language GP. In Proc. Algebraic Informatics (CAI 2009), volume 5725 of Lecture Notes in Computer Science, pages 99-122, Springer
J. Coxon (2013), Uniform Random Generation of Graphs with Graph Grammars, BEng dissertation, University of York
M. Santini (2010), Random Generation and Approximate Counting of Combinatorial Structures, CoRR abs/1012.3000
A. Bertoni, M. Goldwurm, M. Santini (2000), Random Generation and Approximate Counting of Ambiguously Described Combinatorial Structures, in Proc. Theoretical Aspects of Computer Science (STACS 2000), volume 1770 of Lecture Notes in Computer Science, pages 567-580, Springer