public inbox for gcc-help@gcc.gnu.org
 help / color / mirror / Atom feed
* GCC Plugin to insert new expressions/statements in the code
@ 2020-07-15  7:09 Masoud Gholami
  0 siblings, 0 replies; only message in thread
From: Masoud Gholami @ 2020-07-15  7:09 UTC (permalink / raw)
  To: gcc-help

Hi,

I am using GCC 9.3 and writing a plugin that  uses the PLUGIN_PRAGMAS event to register a custom pragma that is expected to be before a function call as follows:

int main() {

	char *filename = “path/to/file”;
	#pragma inject_before_call
	File *f = fopen(filename, …);		// marked fopen (by the pragma)
	…
	fclose(f);
	char *filename2 = “path/to/file2”;
	File *f2 = fopen(filename2, …);		// non-marked fopen
	…
		fclose(f2);
	return 0;

}

In fact, I am using the inject_before_call pragma to mark some fopen calls in the code (in this example, the first  fopen call is marked). Then, for each marked fopen call, some extra expressions/statements/declarations are injected into the code before calling the marked function. For example, the above main function would be transformed as follows:

int main() {

	char *filename = “/path/to/file”;
	File *tmp_f = fopen(“/path/to/another/file”, “w+");
	fclose(tmp_f);
	File *f = fopen(filename, …);
	…
	fclose(f);
	char *filename2 = “path/to/file2”;	// codes not injected for the non-marked fopen
	File *f2 = fopen(filename2, …);
	…
		fclose(f2);
	return 0;

}

Here, because of the inject_before_call pragma, the grey code is injected into the main function before calling the marked fopen. It simply opens a new file (“/path/to/another/file”) and closes it. 
The thing about the injected code is that it should be inserted only if a fopen call is marked by a inject_before_call pragma. And if after the inject_before_call pragma no fopen calls are made, the user gets an error (the pragma should be only inserted before a fopen call).

I implemented this in 3 steps as follows:

1. detection of the marked fopen calls: I created a pragma_handler which remembers the location_t of all inject_before_call pragmas. Then using a pass (before ssa), I look for the statements/expressions that are in the next line of each remembered location. If it’s a fopen call, it is considered as a marked call and the code should be inserted before the fopen call. If it’s something other than a fopen call, an error will be generated. However, I’m not aware if there are any better ways to detect the marked calls.

Here is the simplified pass to find the marked fopen calls (generating errors not covered):

unsigned int execute(function *func) {
basic_block bb;                                                      
FOR_EACH_BB_FN (bb, func) {                                          
	for (gimple_stmt_iterator gsi = gsi_start_bb (bb); !gsi_end_p (gsi); gsi_next (&gsi)) {
 		gimple *stmt = gsi_stmt (gsi);                               
                if (gimple_is_fopen(stmt)) {                     
                        if (marked_fopen(stmt)) {                                         
                        	handle_marked_fopen(stmt);           
                        }                                                        
		}                                                            
	}                                                             
}
} 

2. create the GIMPLE representation of the code to be injected: after finding the marked fopen calls, I construct some declaration and expressions to be injected into the code as follows:

// create the strings “/path/to/another/file" and “w+"
tree another_path = build_string (20, “/path/to/another/file");
fix_string_type (another_path);   
tree mode = build_string (3, “w+\0");
fix_string_type (mode);  

// create a call to the fopen function with the created strings
tree fopen_decl = lookup_qualified_name (global_namespace, get_identifier("fopen"), 0, true, false);
gimple *new_open_call = gimple_build_call(fopen_decl, 2, another_path, mode);

// create the tmp_f declaration
f_decl = build_decl(UNKNOWN_LOCATION, VAR_DECL, get_identifier(“tmp_f"), fileptr_type_node);
pushdecl (f_decl);
rest_of_decl_compilation (f_decl, 0, 0);  

// set the lhs of the fopen call to be f_decl
gimple_call_set_lhs(new_open_call, f_decl)

// create a call to the fclose function with the tmp_f variable
tree fclose_decl = lookup_qualified_name (global_namespace, get_identifier("fclose"), 0, true, false);
gimple *new_close_call = gimple_build_call(fclose_decl, 1, f_decl);


3. add the created GIMPLE trees to the code (basic-blocks):

basic_block bb = gimple_bb(stmt);                                             
for (gimple_stmt_iterator gsi = gsi_start_bb (bb); !gsi_end_p (gsi); gsi_next (&gsi)) {       	gimple *st = gsi_stmt (gsi);                                             
        if (st == stmt) {  // the marked fopen call
		gsi_insert_before(&gsi, new_open_call, GSI_NEW_STMT);
		gsi_insert_after(&gsi, new_close_call, GSI_NEW_STMT);
		gimple_set_bb(new_open_call, bb);
		gimple_set_bb(new_close_call, bb);
		break;
	}
}

This is how I implemented the plugin. However, after compiling a sample code (like the main function above), I get segmentation fault. By defining another pass  to print the statements of the code and by executing this pass after the previous pass (that injects the code), I see correct results (i.e., the injected code is correctly generated and inserted into the right location as I intended). But when I debug the sample code, I see that only the last injected statement (fclose) is executed with NULL in the f_decl variable which causes the segmentation fault. 

I also tried to insert the pass after the “lower” pass which is executed much sooner. Then I used gimple_seq body = gimple_body (current_function_decl)to get the gimple sequence of the current function and injected the new statements into the gimple sequence same as above. Bit it didn’t work out as well.
I searched everywhere, read all the documentations I could find, and digged into the gcc code for other pragmas (i.e. omp parallel, etc.). But still I have no success in doing this correctly. Could you please point me where the problem is? 

Thanks,
M. Gholami

^ permalink raw reply	[flat|nested] only message in thread

only message in thread, other threads:[~2020-07-15  7:09 UTC | newest]

Thread overview: (only message) (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2020-07-15  7:09 GCC Plugin to insert new expressions/statements in the code Masoud Gholami

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).