Changelog:

29 August 2024: fix some bugs in test.py, particularly related to the test filtering options
29 August 2024: mention that submission should use the submission site (see Canvas)
3 Sep 2024: make test.py consider too-long encodings of messages a test failure instead of just printing an error about them and saying the test passed; we will not deduct credit for these problems when grading.
6 Sep 2024: add note re: trap of choosing ambiguous message delimiters

If you downloaded the skeleton code before around 5:45pm 29 August 2024, please get an updated version of test.py here that fixes some bugs that would cause the testing script to crash and not correctly handlesome of the test filtering options. Until around 12:10pm 3 Sep 2024, the testing script would report an error when messages’ encoding was too long but not consider this a test failure. We will not deduct credit for this when grading, but the test script will correctly consider this an error.

1 Your Task

Download the supplied skeleton code at here.¹ (last updated 3 Sep 2024 12:10pm)
In sendrecv.py complete an implementation of MySender and MyReceiver.

In MySender’s send_message function, receive a message in bytes, then format a message to be sent as a sequence of bits, then call self.channel.send_bits() with those bytes.

In MyReceiver’s handle_bit_from_network function, receive a bit from the network, then call self.got_message_function() when those bytes represent a full message passed to MySender’s send_message function.

To demonstrate the interface, we supply an example implementation which is partially functional. It transmits messages by converting each byte into bits, and adding a nul byte to separate messages. This satisfies some but not all of the requirements below.

Your implementation must:
1. Be able to send messages of any length between 0 and 1023 bytes, inclusive
2. Allow messages to contain any sequence of bytes (of an appropriate length).
  
  (This means that, for example, we should be able to take some of the bits your send_message function produces and send a new message containing them without it being corrupted.)
3. If a message’s bytes are corrupted, by changing the values of bits randomly and/or adding and removing bits randomly, then:
  1. You should detect this and avoid calling got_message_function with data that does not represent a valid message at least 99.99% of the time. (You should be able to achieve this with a 2 or more-byte checksum.)
  2. Other than possibly the next few messages or couple kilobytes of messages, future uncorrupted messages should be received correctly (regardless of where the corruption took place).
4. Not use too much space to send messages:
  - Including any headers/separators, the average size of a random message of N bytes at most 120 bytes or N * 1.2 bytes, whichever is larger. (It’s okay if some pathological messages take up more space.)
5. Be your own code and use only functionality built-in to python or supplied as part of its standard library. This means that it is permissible to use the zlib module’s crc32 function and the struct module. But it is not permissible to use, for example, python-pppd
Test your implementation using test.py. If you want to run additional tests, you might find it useful to modify the list of tests in TESTS or pass the --verbose option to this script, or run only specific tests with the --only-test and/or --only-subtest options.

When grading, we may run additional tests and/or inspect your code by hand. Although the supplied tests do not intentionally omit functionality you must implement, since the supplied tests are randomized, they may get unlucky and fail to detect an implementation which is actually broken and they do not test everything for all possible message sizes. (Also, it may be possible for them to spuriously fail on a correct implementation, though I think that is unlikely.)
Briefly document in a comment or docstring near the top of your sendrecv.py the frame format you choose.
Submit your sendrecv.py by using the submission site.

2 Hints

2.1 Interpreting test output

The test will output a summary line like:
```
three-message-clean: 0 extra; 0 corrupted; 0 missing; 27 bytes in 3 messages sent with 48.0 bytes
```
In this message:
- three-message-clean is a test name from the TESTS array in test.py.
- 0 extra says that 0 is the number of messages received that were not sent
- 0 corrupted says that 0 is the number of messages received that were apparently changed
- 0 missing is the number of message sent that were not received
- 27 bytes in 3 messages means that send_message was called 3 times, and the total number bytes passed across all calls was 27;
- with 48.0 bytes means that send_bytes was called with a total of 48 bytes
The test uses Python’s difflib, which implements an algorithm similar to the diff utility. In some cases, this may get confused between corrupted and extra+missing messages.

3 Hints

You can reuse the supplied bit to byte and byte to bit conversions..
The struct modules pack or unpack to convert integers to bytes and vice-versa.
[added 6 Sep 2024 5:15pm, copies Piazza post:] Be careful about choosing ambiguous message delimters.

For example, let’s say you choose 0000 as a delimiter for messages and to escape 000 with 0001 and we try to send the message 111111100.

In this case, we’ll send 111111100 0000. There was no reason to escape anything. When we try to remove the delimiter from this, we’ll find the first 0000, and get 1111111, which is not our original message.

The problem here is that the escaping is not sufficient to handle the end-of-message edge case. We could fix this by escaping every 0 instead of every 000, but that’s pretty inefficient. A more space-efficient solution is to choose a delimiter pattern that has a distinctive transition from 0s to 1s that allows us to escape less frequently. The choice of 010 in lecture and 01111110 in the textbook (Systems Approach section 2.3) are examples of this.

If you want to work on portal or similar, you might find it useful to right-click the link, select copy link or copy URL from the drop-down menu, then run the command wget THAT_URL, where THAT_URL is the result of pasting the link/URL.↩︎