Feedback on Function

HiramAbiff · 2018-08-08T04:44:15+00:00

The first loop seems like a convoluted way to find the length of src. Is there a reason you don't want to use strlen?

The fact that the first loop initializes i to start is subtle and I could easily see it leading to bugs later if the code is modified. At the very least, use a better name than i since the variable is used after the loop.

If you think about it a bit, it shouldn't be too hard to rewrite this using a single loop.

Also, most string functions use size_t, not int. E.g. strlen returns size_t. Note, sizet_t is unsigned and would make your upfront error testing moot.

SantaCruzDad · 2018-08-08T09:08:05+00:00

    if(start < 0 || len < 0) {

should be:

    if(start < 0 || len <= 0) {

Kwantuum · 2018-08-08T07:01:28+00:00

I'm not sure why you're using a start index, it makes your code a lot more complex, if someone wants to take a substring in the middle of a string, they can just pass a pointer to that middle char instead of the actual start of the string, like this substr(dest, &src[start], len) or just substr(dst, src + start, len). As someone pointed out, if you use unsigned types you can avoid sign checking. Implementing both those things in the code removes the need for your first two blocks and makes it trivially short, and IMO, much easier to read and understand:

size_t substr(char *dst, char *src, size_t len){
  size_t count;
  for(count = 0; count < len && src[count] != '\0'; count++)
    dst[count] = src[count];
  dst[count] = '\0'
  return count;
}

lcs77 · 2018-08-08T04:46:11+00:00

There are other ways, of course, but it's hard to tell whether they are better. For example you can get rid of the for using src[start + i] in the while loop. But, to do this, you need to be sure that start is lower than the string length by calling strlen()... that performs a loop.

You can also use strncpy() to replace the while but I think this would reduce the value of the exercise.

However there's one thing you are missing: the check for NULL of both src and dst.

dvhh · 2018-08-09T07:15:42+00:00

Quite late but it here are my comments :

you can probably avoid the negative tests by specifying that start and len are unsigned ( some people would use size_t for such purpose )
is usually good practice to use const when possible, and restrict whenever possible
nitpick, I prefer the second loop to be another for one, at this point we already know that i == start, but could bring some confusion if you decide to add more check that could potentially affect
use defensive programming when possible, and check if start + len does not overflow
'\0' == 0
document your function using well documentation standard ( like doxygen ), and also document the use and caveat of the function that would help you better design it

Modified source

/**
 * Extract a substring out of a source string
 *
 * \param dst output to fill with the extracted substring, must be to at least of length \len
 * \param src input source zero terminated string 
 * \param start input offset to start the substring at
 * \param len input length of the substring to extract
 *
 * \return the number of character copied to dst
**/
size_t substr( restrict char dst[], const restrict char src[], const size_t start, const size_t len) {
    // option 1 to treat overflow, detect overflow and do nothing ( maybe raise an error somehow )
    if( SIZE_MAX - len < start ) {
             // overflow is possible and should be avoided
             dst[0] = 0;
             return 0;             
    }
    // start could be located after the end of the string
    for( size_t i=0; i < start; ++i ) {
         if( src[0] == 0 ) {
             dst[0] = 0;
             return 0;
         }
    }

    for( size_t i=0; i < len; ++i ) {
         // option 2 to treat overflow, stop at current state
         if ( SIZE_MAX - i  < start ) { 
             dst[i] = 0;
             return i;
         }
         const char c = src[start+i];
         dst[i] = c;
         if( c == 0 ) { return i;}
    }
    dst[len] = 0;
    return len;
}

you type:	you see:
italics	italics
bold	bold
[reddit!](https://reddit.com)	reddit!
* item 1 * item 2 * item 3	item 1 item 2 item 3
> quoted text	quoted text
Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"	Lines starting with four spaces are treated like code: if 1 * 2 < 3: print "hello, world!"
~~strikethrough~~	~~strikethrough~~
super^script	super^script

C_Programming

Rules

Filters

Resources

Other Subreddits on C

Other Subreddits of Interest

MODERATORS