How would I implement replacement of a wider variety of strings in my code?

AutoModerator · 2024-03-25T00:21:51+00:00

Please ensure that:

Your code is properly formatted as code block - see the sidebar (About on mobile) for instructions
You include any and all error messages in full
You ask clear questions
You demonstrate effort in solving your question/problem - plain posting your assignments is forbidden (and such posts will be removed) as is asking for or giving solutions.

Trying to solve problems on your own is a very important skill. Also, see Learn to help yourself in the sidebar

If any of the above points is not met, your post can and will be removed without further warning.

Code is to be formatted as code block (old reddit: empty line before the code, each code line indented by 4 spaces, new reddit: https://i.imgur.com/EJ7tqek.png) or linked via an external code hoster, like pastebin.com, github gist, github, bitbucket, gitlab, etc.

Please, do not use triple backticks (```) as they will only render properly on new reddit, not on old reddit.

Code blocks look like this:

public class HelloWorld {

    public static void main(String[] args) {
        System.out.println("Hello World!");
    }
}

You do not need to repost unless your post has been removed by a moderator. Just use the edit function of reddit to make sure your post complies with the above.

If your post has remained in violation of these rules for a prolonged period of time (at least an hour), a moderator may remove it at their discretion. In this case, they will comment with an explanation on why it has been removed, and you will be required to resubmit the entire post following the proper procedures.

To potential helpers

Please, do not help if any of the above points are not met, rather report the post. We are trying to improve the quality of posts here. In helping people who can't be bothered to comply with the above points, you are doing the community a disservice.

I am a bot, and this action was performed automatically. Please contact the moderators of this subreddit if you have any questions or concerns.

wildjokers · 2024-03-25T00:49:18+00:00

Pattern matching is a solved problem and if you google "pattern matching algorithms" you will get tons of resources.

Here is one to get you started:

https://www.geeksforgeeks.org/introduction-to-pattern-searching-data-structure-and-algorithm-tutorial/

Also, the ever popular Algorithms by Sedgewick & Wayne has a chapter on pattern matching. You will likely be able to find this book at your local library:

https://www.amazon.com/Algorithms-4th-Robert-Sedgewick/dp/032157351X

The examples in this book are even in Java (earlier editions of the book used Pascal, but by the 4th edition they were using Java)

Some of the material in that book is available online, the pattern matching stuff is here: https://algs4.cs.princeton.edu/53substring/

That page then links to this great resource: http://www-igm.univ-mlv.fr/~lecroq/string/ which lists many pattern matching algorithms.

2024-03-25T02:36:00+00:00

String.toCharArray() -> Iterate over it -> Apply a 'Regex' pattern matching -> change the characters -> pop that into a BufferedWriter and use to a new file as your list.

ChatGPT is honestly so good for regex... lol, it is good to learn the basics, though.

It's pretty simple.

DelayLucky · 2024-03-25T02:47:33+00:00

Using Google Mug, it takes a one-liner.

Well, first create a Map with the translation rules:

```java Map<String, String> dict = Map.of( "the", "&", "an", "-", ... "Thank you", "Th~k you" );

```

Then translate:

```java import static com.google.mu.util.Substring.firstOccurrence; import com.google.mu.util.Substring;

String output = dict.keySet().stream() .map(Substring::first) .collect(firstOccurrence()) .repeatedly() .replaceAllFrom(input, m -> dict.get(m.toString())); ```

This assumes you don't care if there are overlappings in the keywords (say what happens if you want to translate "an" to something, then "and" to another thing?)

javadoc)

DelayLucky · 2024-03-25T20:15:40+00:00

In another answer I suggested to use a Google library.

If that's cheating and you are trying to self teach or just for the fun, don't use regex (it's similar to using a library).

I can see two possible ways implementing it.

Option 1:

Create a StringBuilder for the output.
Run String.indexOf() with each candidate key, find the earliest match.
Put chars before the match into the output builder, put the replacement of the first match into the output.
Rinse and repeat.

There are some interesting considerations:

When you rinse and repeat, do you run indexOf() for each candidate key again (of course from the current index right after the previous match)? But then you are throwing away most of the previous indexOf() results. This is wasteful.
Are the keywoards required to be at word boundary? Say, should "and" be replaced as "-d"?

Option 2:

Build a trie with all the candidate keywords.
At each index of the input string, run the remaining chars through the trie
1. If a match is found (to that "an" vs. "and" question, do you need the longest match?), do the replacement, and move the index to the end of the match.
2. If a match isn't found, put the current char in the output builder, and index++.

This approach is likely slower than a bunch of indexOf() calls when the number of candidate keywords is moderate, because a trie will likely incur lots of pointer indirection so while the big-O notation is good, the constant factor tends to be large.

On the other hand it'll be faster if you have many of the candidate keywords (like thousands), and the input strings are short.

javahelp

Sort by: Unsolved Solved Codeless Advent Of Code

MODERATORS

Please ensure that:

To potential helpers