Question

我是一位简单语言的汇编者。

我做了一种弹性/象征性的制片,在<代码>stdout上填写了标语。

现在,我想作一些理论分析,但我不知道如何修改我的灵活性,以便把标语作为投入。

A linked list is extremely inefficient for large files (source files around 80MB take about 1.3GB of ram)
I could modify my lexer to give the next token every time it is called (idea taken from the Dragon Book), but I don t know what I will do if somewhere in the process I have to go back and read a previous token.

做这些事情的正确途径是什么?

Answer 1

Implementing a nextToken() method in the lexical analyser is the standard way. This method is called by the parser (or syntax analyser) until the entire input has been consumed.

but I dont what I will do if somewhere in the process i have to go back and read a previous token

This is not usually the case. But, what the parser may need to do is push back a token (or a number of tokens depending on the lookahead of the parser) which has already been seen. In this case the lexer provides a pushBack(Token) which ensures that the next call to nextToken() will return the supplied token, rather than the next token appearing in the input.

Answer 2

但是,如果在该进程的某些地方必须回头看一去,我会做些什么。

It sounds like your matches are too greedy.

You might look into Backtracking

友情链接