Brandon Birmingham

Computer Scientist, Tech Consultant

University of Malta

Brandon Birmingham is a Computer Scientist and has recently finalised his PhD research in Vision and Language at the University of Malta. He contributed to Spatial Relation Detection in images and developed novel models for automatic Image Caption Generation.

He graduated with a BSc (Hons) and MSc in Computer Science from the University of Malta in 2015 and 2016, respectively. In parallel with his academic studies and duties, he worked in the industry as a Software and Business Intelligence Developer.

He is passionate about creative problem solving and the engineering of innovative technological solutions. Brandon is also intrigued in psychology and neuroscience.

Download CV.

Interests

Data Science
Machine Learning
Computer Vision

Education

PhD in Artificial Intelligence, 2022

University of Malta
MSc in Computer Science, 2016

University of Malta
BSc (Hons) in Computing Science, 2015

University of Malta

Skills

Programming

Python, Java, C#, C, C++, Assembly, Web development

Artificial Intelligence

Tensorflow, Pytorch, Pandas, Numpy, scikit-learn, NLTK, Matplotlib

Business Intelligence

SQL, SSIS, SSRS, SSAS

Work Experience

Senior Business Intelligence Developer

Eunoia

Jul 2017 – Mar 2019 Malta

Responsible for the development, implementation and support of Business Intelligence solutions based on Microsoft technologies as well as for the overall engineering and implementation of a web-based financial consolidation application.

Business Intelligence Developer

PTL Ltd.

Dec 2016 – Jun 2017 Malta

Responsible for the implementation of a web-based financial consolidation tool that supports a BI infrastructure for reporting and visualisation.

Software Developer

HOB

Jun 2012 – Aug 2016 Germany

Full Stack Web developer (based in Malta) responsible for an internal web application that serves as a project management tool. I was also introduced to one of the main products of the company, a secure VPN solution, which I was going to be responsible for the re-design of its IKE protocol.

Certificates

Neural Networks for Machine Learning

Coursera Aug 2017

Course offered by University of Toronto and taught by Prof Geoffrey Hinton. This course covers Artificial Neural Networks (ANNs) and how they are being used for Machine Learning, as applied to speech and object recognition, image segmentation, modelling language and human motion, etc.

See certificate

Probabilistic Graphical Models

Coursera Jul 2017

Course offered by Stanford University and taught by Prof Daphne Koller. This course covers Probabilistic graphical models (PGMs) are a rich framework for encoding probability distributions over complex domains: joint (multivariate) distributions over large number of random variables that interact with each other. These representations sit at the intersection of statistics and computer science, relying on concepts from probability theory, graph algorithms, machine learning, and more.

See certificate

Projects

Web-RIC

Web-Retrieval based Image Captioning framework developed in Python and designed to automatically caption images by using text retrieved from the Web.

JPEG-Restorer

A file carving tool developed in C which exploits the syntactic structure of the JPEG file format while using thumbnail affinity for semantic analysis.

Featured Publications

Brandon Birmingham, Adrian Muscat

January, 2021 Spatial Cognition & Computation

Multi Spatial Relation Detection in Images

Detecting spatial relationships between objects depicted in an image is an important sub-task in vision and language understanding. Its practical use lies in visual discourse when referring to objects by their relationship in context of others and finds application in higher level tasks such as visual question answering and image description generation. Presumably, the selection of spatial prepositions grounded in an image is straightforward. However, in general, human beings either do not always agree or are not consistent when choosing spatial prepositions. This could be due to various reasons, such as near synonyms, overlapping terms and different frames of reference. For these reasons, the automatic detection of spatial relations is a non-trivial multi-label problem. This paper addresses the automatic multi-selection of prepositions. The study is based on the development of a number of machine learning models, namely Nearest Neighbor (NN), k-Means Clustering (kM-C), Agglomerative Hierarchical Clustering (A-HC) and Multi-label Neural Network (ML-NN). The model performances are compared quantitatively using multi-label metrics as well as human evaluations that are independent of the ground truth labels. Additionally, the classification results are used as a basis to carry out an error and qualitative analysis that sheds light on the relative merits of how each model deals with synonymous and overlapping relations, and groups common errors to inform future directions. Furthermore, to gain insight into the merits of multi-label models, a single-label Random Forest (RF) classifier is developed and its results are included in the analysis. Of all multi-label models, the ML-NN exhibits the best overall performance when evaluated on both the dataset ground truth and the independent human evaluations. It, however, suffers from under-generating prepositions, while the rest of the models often generate more prepositions at the expense of precision. The clustering-based methods are also not quite consistent, although they do better than the other models in less frequent spatial configurations that other models struggle with. The results from the single-label RF classifier highlight the usefulness of having a multi-label model. Finally, the error analysis indicates that the majority of errors is due to lack of features that give cues on object position and orientation (object pose), the fixed frame of reference, and the failure to resolve depth in perspective view.

Brandon Birmingham, Adrian Muscat, Anja Belz

November, 2018 Proceedings of the 11th International Conference on Natural Language Generation

Adding the Third Dimension to Spatial Relation Detection in 2D Images

Detection of spatial relations between objects in images is currently a popular subject in image description research. A range of different language and geometric object features have been used in this context, but methods have not so far used explicit information about the third dimension (depth), except when manually added to annotations. The lack of such information hampers detection of spatial relations that are inherently 3D. In this paper, we use a fully automatic method for creating a depth map of an image and derive several different object-level depth features from it which we add to an existing feature set to test the effect on spatial relation detection. We show that performance increases are obtained from adding depth features in all scenarios tested.

Publications

Filter Publications

Brandon Birmingham, Adrian Muscat (2021). Multi Spatial Relation Detection in Images. Spatial Cognition & Computation.

Cite DOI URL

Brandon Birmingham, Adrian Muscat (2019). Clustering-based Model for Predicting Multi-spatial Relations in Images. Proceedings of the 16th International Conference on Informatics in Control, Automation and Robotics, ICINCO 2019 - Volume 2, Prague, Czech Republic, July 29-31, 2019.

Cite DOI URL

Brandon Birmingham, Adrian Muscat, Anja Belz (2018). Adding the Third Dimension to Spatial Relation Detection in 2D Images. Proceedings of the 11th International Conference on Natural Language Generation.

Cite DOI URL

Brandon Birmingham, Reuben A. Farrugia, Mark Vella (2017). Using thumbnail affinity for fragmentation point detection of JPEG files. IEEE EUROCON 2017 -17th International Conference on Smart Technologies (Best Student Paper).

Cite Project DOI

Brandon Birmingham, Adrian Muscat (2017). The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System. Proceedings of the Sixth Workshop on Vision and Language.

Cite Project DOI URL

Anja Belz, Adrian Muscat, Brandon Birmingham, Jessie Levacher, Julie Pain, Adam Quinquenel (2016). Effect of Data Annotation, Feature Selection and Model Choice on Spatial Description Generation in French. Proceedings of the 9th International Natural Language Generation conference.

Cite DOI URL

Adrian Muscat, Anja Belz, Brandon Birmingham (2016). Exploring Different Preposition Sets, Models and Feature Sets in Automatic Generation of Spatial Image Descriptions. Proceedings of the 5th Workshop on Vision and Language, Berlin, Germany, August. Assosciation for Computational Linguistics.

Cite

Contact

brandon.birmingham.12@um.edu.mt
(+356) 2340 3563
Level 0, Block A, Room 11, Faculty of ICT, University of Malta, Msida
DM Me