Abstract:
We propose a clustering-based filtering method for a similar source code fragment search.
This method groups code fragments by clustering and excludes codes with a low probability of correct
answers from the output. We confirm the usefulness of the proposed method using an existing dataset of
similar code fragments including bugs.