Project Name: Compiler Project - Phases 1 & 2

Overview

This repository contains the source code and documentation for a team project involving the implementation of two phases of a compiler. The project was a collaborative effort of four team members aimed at building a compiler capable of lexical analysis (Phase 1) and syntax analysis (Phase 2).

Project Phases

Phase 1: Lexical Analysis

In Phase 1, our primary focus was on lexical analysis, where we designed and implemented algorithms to recognize tokens by scanning lexical rules. This involved the following steps:

Lexical Rule Specification: We defined lexical rules to describe the syntax of tokens in the input programming language.
Nondeterministic Finite Automaton (NFA) Construction: Based on the lexical rules, we constructed NFAs to represent the token recognition process.
Deterministic Finite Automaton (DFA) Minimization: We converted NFAs into minimal DFAs to optimize the token recognition process.
Token Detection: Utilizing the DFA, we implemented token detection logic to identify tokens from the input source code.

Phase 2: Syntax Analysis

Phase 2 of the project involved syntax analysis, where we focused on parsing the input code based on grammar rules. Our approach included:

Grammar Rule Examination: We scanned the grammar rules of the programming language to understand its syntax structure.
LL(1) Grammar Transformation: To facilitate parsing, we transformed the grammar into LL(1) format by addressing issues such as left factoring and removing left recursion (both immediate and non-immediate).
First and Follow Set Calculation: We identified the first and follow sets of each non-terminal symbol in the grammar.
Parse Table Generation: Using the calculated first and follow sets, we constructed a parse table that facilitated the parsing process.
Parsing Algorithm Implementation: With the parse table in place, we implemented parsing algorithms to scan the input code and generate a stack of actions for each token, this also allows us to build parse tree.

Usage

Input code followed some rules and constraints, they are to be added to the repository.
A test input is to be found in the repo.
The simulation stack is saved in an csv file in the same root directory.

Dependencies

Programming Language: C++

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
reports		reports
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
DFA.cpp		DFA.cpp
DFA_test.cpp		DFA_test.cpp
Main.java		Main.java
NFA.h		NFA.h
README.md		README.md
input_example.txt		input_example.txt
lex.yy.c		lex.yy.c
lexer		lexer
lexicalAnalyzer.l		lexicalAnalyzer.l
lexical_analyzer.cpp		lexical_analyzer.cpp
lexical_analyzer.h		lexical_analyzer.h
lexical_rules.cpp		lexical_rules.cpp
main.cpp		main.cpp
node.h		node.h
output.txt		output.txt
parser.cpp		parser.cpp
parser.h		parser.h
utilities.h		utilities.h

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Project Name: Compiler Project - Phases 1 & 2

Overview

Project Phases

Phase 1: Lexical Analysis

Phase 2: Syntax Analysis

Usage

Dependencies

About

Uh oh!

Releases

Packages

Languages

AliELSharawy/Compiler

Folders and files

Latest commit

History

Repository files navigation

Project Name: Compiler Project - Phases 1 & 2

Overview

Project Phases

Phase 1: Lexical Analysis

Phase 2: Syntax Analysis

Usage

Dependencies

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages