# About Passwords, Birds, and Common Regexes **Challenge 287 solutions in Perl by Matthias Muth** ## Task 1: Strong Password > You are given a string, $str.
> Write a program to return the minimum number of steps required to make the given string very strong password. If it is already strong then return 0.
> Criteria:
> - It must have at least 6 characters.
> - It must contains at least one lowercase letter, at least one upper case letter and at least one digit.
> - It shouldn't contain 3 repeating characters in a row.
> > Following can be considered as one step:
> - Insert one character
> - Delete one character
> - Replace one character with another
> > Example 1
> Input: \$str = "a"
> Output: 5
>
> Example 2
> Input: \$str = "aB2"
> Output: 3
>
> Example 3
> Input: \$str = "PaaSW0rd"
> Output: 0
>
> Example 4
> Input: \$str = "Paaasw0rd"
> Output: 1
>
> Example 5
> Input: \$str = "aaaaa"
> Output: 3
#### Dealing with short passwords If the password is too short, there is no other way than inserting characters up to the required length. Example 2 is a test case for this:
`aB2` => 3 (like insert `X`, `Y`, `Z` to get `aB1XYZ` )
but also:
`""` => 6 (like `aB1cde`). #### Dealing with missing categories For every category (digits, upper case or lower case letters) that we don't find, we can choose to *insert* a new character or to *replace* an existing one. We will see later that both operations have their use, in different cases. We can use some extra test cases for this:
`abcABC` => 1 (e.g. *insert* `1` to get `abcABC1`, or *replace* `b` by `1` to get `a1cABC`)
`abcdef` => 2 (e.g. *insert* `1` and `A` to get `abcdef1A`, or *replace* `b` by `1` and `c` by `A` to get `a1Adef`)
#### Dealing with repeating characters (like 'aaaaa') The third criteria says that the password '*shouldn't contain 3 repeating characters in a row*'.
If we find a sequence that is longer, there are three ways to modify it to comply with the rule: * *Insert* a different character into the sequence after every second character.
This will split the sequence into chunks of two, separated by the new characters:
`aaaaaa => aaiaaiaa` (2 *insert*s for a sequence of 6)
`aaaaaaaaa => aaiaaiaaiaaia` (**4 *inserts*** for a sequence of 9)
* *Replace* every third character by a different one.
This also splits it into chunks of 2, but it is more efficient. Each replaced character 'consumes' one character of the sequence and thus reduces the amount of work still left to do:
`aaaaaa => aaraar` (2 replacements for a sequence of 6)
`aaaaaaaaa => aaraaraar` (**3 *replacements*** for a sequence of 9)
* *Delete* all characters in excess of the first 2.
Deleting characters is a dangerous thing, and it is inefficient:
Dangerous, because it also reduces the overall length of the password, and in the end we might be violating the 'at least 6 characters' criteria.
Inefficient, because each *deletion* only takes care of exactly one excess character, whereas at the same time, *replacing* one character at the right spot 'neutralizes' three of them:
`aaaaaa => aa` (4 *deletions* for a sequence of 6)
`aaaaaaaaa => aa` (**7 *deletions*** for a sequence of 9)
So we better do not consider 'delete' operations at all. Some more examples: | sequence length | example sequence | *insert* result | *replace* result | *insert* operations needed | *replace* operations needed | | :-------------: | :--------------- | :------------------ | :--------------- | :------------------------: | :-------------------------: | | 6 | `aaaaaa` | `aabaabaa` | `aabaab` | 2 | 2 | | 7 | `aaaaaaa` | `aabaabaaba` | `aabaaba` | 3 | **2** | | 9 | `aaaaaaaaa` | `aabaabaabaaba` | `aabaabaab` | 4 | **3** | | 12 | `aaaaaaaaaaaa` | `aabaabaabaabaabaa` | `aaXaaXaaXaaX` | 5 | **4** | These examples show that for dealing with longer sequences, *replacing* characters is more efficient than *inserting* characters.
This leads us to some more test cases:
`aaa1B` => 1 (e.g. *inserting* one `b` to get `aaba1B`)
`aaaa1B` => 1 (e.g. *replacing* one `a` by `b` to get `aaba1B`)
`aaaaa1B` => 1 (e.g. *replacing* one `a` by `b` to get `aabaa1B`)
`aaaaaa1B` => 2 (e.g. *replacing* 2 times `a` by `b` to get `aabaab1B`)
`aaaaaaa1B` => 2 (e.g. *replacing* 2 times `a` by `b` to get `aabaaba1B`)
`aaaaaaaa1B` => 2 (e.g. *replacing* 2 times `a` by `b` to get `aabaabaa1B`)
`aaaaaaaaaaaa1B` => 4 (e.g. *replacing* 4 times `a` by `b` to get `aabaabaabaab1B`) #### Up to three birds with one shot! It turns out that if we are lucky, by *inserting* one character we can solve three problems at the same time: * make the password longer, * add a missing category by choosing the character to be from that category, * shorten a long repeated-character sequence by inserting the character to split off two characters of the sequence. In this example, one operation solves all three shortcomings, which makes it another good test case:
`aaaBc` => 1 (e.g. by *inserting * one digit at the right place, like `aa1aBc`).
#### And another two birds with another shot... Similarly, we also can solve *two* problems at the same time by *replacing* one character: * add a missing category by choosing the character to be from that category, * shorten a long repeating character sequence by choosing a 'third' character to be replaced, which will 'neutralize' the two characters preceding it. For example (and yet another test case):
`aaaabC` => `aa1abC` => 1 change only for solving two shortcomings. #### Solution structure For our solution this means that we can proceed as follows: 1. We determine how many *inserts* are needed for reaching the **minimum password length**.
Every *insert* is counted as a cost for our final result (the number of operations needed).
In addition, we keep a separate counter for the inserted characters, which we will decrement when we use them 'for more birds' (to add categories or to split up long sequences). 2. For each **missing category**, we first use one of the characters inserted in step 1 for adding that category.
This causes *no additional cost*, as long as we have any of those characters left. 3. If we *still* are missing categories, we *replace* existing characters of another category by a character of the respective missing category.
Every character to be replaced is counted as a cost.
We don't care to choose *which* characters to be replaced here, because we might choose good positions later, when we try to split long repeating sequences.
For this, we also keep another counter for the replaced characters.
The reason why we prefer replacing to inserting here is that later on, *replaced* characters might serve better than *inserted* ones. 4. For every **long repeating character sequence** that we find, we do this: Position any *replacement* characters from the previous step on every third character of the sequence, each one 'neutralizing' *three* characters of the repetition, until we have used all the *replacement* characters, or the sequence is fully neutralized.
There's *no additional cost* for this. If there are no *replaced* characters available any more, position any *inserted* characters from step 1 after every second character of the sequence, neutralizing two characters of the sequence.
Even if using the *inserted* characters is less efficient than using replacements, these here come for free, because we've paid for them already. Now they can be used with *no additional cost*. For the rest of the sequence, we *replace* every third characters within the sequence to split it up further.
These replacements count for the cost, though. 5. We return the cost that we have determined. Here are some more 'combination' test cases:
`aaaacccc` => 2 (like `aa1accBc) (replacing one `a` by a digit,
and one `c` by an upper case character.
and two `b`s by anything else but a `b`).
`aaaaaabbbbbb` => 4 (like `aa1aaXbbYbbY`),
(replacing one `a` by a digit,
another `a` by an upper case character,
and two `b`s by anything else but a `b`).
`aaacc` => 2 (e.g. `aa1ccX`).
And this is my solution: ```perl use v5.36; use List::Util qw( sum min ); use constant MIN_PASSWORD_LENGTH => 6; sub strong_password( $str ) { # Make pattern matches easier to write. $_ = $str; my ( $cost, $available_inserted, $available_replaced ) = ( 0, 0, 0 ); # 1: Do *inserts* for bringing the password up to length. if ( length() < MIN_PASSWORD_LENGTH ) { $available_inserted = MIN_PASSWORD_LENGTH - length(); $cost += $available_inserted; } # 2: Use the inserted characters to add missing categories # (*no additional cost!*), my $n_missing_categories = sum( ! /\d/, ! /[a-z]/, ! /[A-Z]/ ); if ( $n_missing_categories && $available_inserted ) { my $n_to_use = min( $n_missing_categories, $available_inserted ); $n_missing_categories -= $n_to_use; } # 3: If there still are categories missing, # we *replace* existing characters. if ( $n_missing_categories ) { $available_replaced = $n_missing_categories; $cost += $available_replaced; } # 4: Deal with long repeating sequences (3 or more same characters). while ( /(.)\1\1\1*/g ) { my $sequence_length = length( $& ); while ( $sequence_length > 2 && $available_replaced ) { $sequence_length -= 3; --$available_replaced; } while ( $sequence_length > 2 && $available_inserted ) { $sequence_length -= 2; --$available_inserted; } while ( $sequence_length > 2 ) { $sequence_length -= 3; ++$cost; } } return $cost; } ``` The `ch-1.pl` file also contains switchable debugging output, and all the test cases. This is its output: ```terminal ok 1 - Example 1: strong_password( 'a' ) == 5 ok 2 - Example 2: strong_password( 'aB2' ) == 3 ok 3 - Example 3: strong_password( 'PaaSW0rd' ) == 0 ok 4 - Example 4: strong_password( 'Paaasw0rd' ) == 1 ok 5 - Example 5: strong_password( 'aaaaa' ) == 2 (like 'aa1aaB') ok 6 - Extra 1: strong_password( '' ) == 6 (like '1aBcde') ok 7 - Extra 2: strong_password( 'abcABC' ) == 1 (like 'a1cABC') ok 8 - Extra 3: strong_password( 'abcdef' ) == 2 (like 'a1Adef') ok 9 - Extra 4: strong_password( 'aaa1B' ) == 1 (like 'aaba1B' using one insert) ok 10 - Extra 5: strong_password( 'aaaa1B' ) == 1 (like 'aaba1B' using one replace) ok 11 - Extra 6: strong_password( 'aaaaa1B' ) == 1 (like 'aabaa1B') ok 12 - Extra 7: strong_password( 'aaaaaa1B' ) == 2 (like 'aabaab1B') ok 13 - Extra 8: strong_password( 'aaaaaaa1B' ) == 2 (like 'aabaaba1B') ok 14 - Extra 9: strong_password( 'aaaaaaaa1B' ) == 2 (like 'aabaabaa1B') ok 15 - Extra 10: strong_password( 'aaaaaaaaaaaa1B' ) == 4 (like 'aabaabaabaab1B') ok 16 - Extra 11: strong_password( 'aaabC' ) == 1 (like 'aa1abC' using one insert) ok 17 - Extra 12: strong_password( 'aaaabC' ) == 1 (like 'aa1abC') ok 18 - Extra 13: strong_password( 'aaaacccc' ) == 2 (like 'aa1accBc') ok 19 - Extra 14: strong_password( 'aaaaaabbbbbb' ) == 4 (like 'aa1aaXbbYbbY') ok 20 - Extra 15: strong_password( 'aaacc' ) == 2 (like 'aa1ccX') 1..20 ``` It has never been easier to create good passwords! :wink::joy: ## Task 2: Valid Number > You are given a string, $str.
> Write a script to find if it is a valid number.
> Conditions for a valid number:
> > - An integer number followed by an optional exponent.
> - A decimal number followed by an optional exponent.
> - An integer number is defined with an optional sign '-' or '+' followed by digits.
>
> Decimal Number:
> A decimal number is defined with an optional sign '-' or '+' followed by one of the following definitions:
> - Digits followed by a dot '.'.
> - Digits followed by a dot '.' followed by digits.
> - A dot '.' followed by digits.
>
> Exponent:
> An exponent is defined with an exponent notation 'e' or 'E' followed by an integer number.
>
>
> Example 1
> Input: $str = "1"
> Output: true
>
> Example 2
> Input: $str = "a"
> Output: false
>
> Example 3
> Input: $str = "."
> Output: false
>
> Example 4
> Input: $str = "1.2e4.2"
> Output: false
>
> Example 5
> Input: $str = "-1."
> Output: true
>
> Example 6
> Input: $str = "+1E-8"
> Output: true
>
> Example 7
> Input: $str = ".44"
> Output: true
Probably the easiest way to solve this is to use the `Regexp::Common` CPAN module: ```perl use v5.36; use Regexp::Common; sub valid_number( $str ) { return $str =~ /^$RE{num}{real}$/; } ``` This works well for all examples. If you don't want to use a module, this regular expression might be used instead: ```perl use v5.36; sub valid_number( $str ) { return $str =~ /^ [+-]? (?: \.\d+ | \d+(?:\.\d*)? ) (?: [Ee] [+-]? \d+ )? $/xa; } ``` Nothing really special about it, except maybe the `a` modifier, which is used to make sure that no other Unicode characters with a 'digit' property match `\d`. Basically this makes `\d` equivalent to `[0-9]`, which is the safer most of the time. #### **Thank you for the challenge!**